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Abstract. We prove two finite dimensional approximation results and a sym- 
plectic non-squeezing property for the Korteweg-de Vries (KdV) flow on the 
circle T. The nonsqueezing result relies on the aforementioned approximations 
and the finite-dimensional nonsqueezing theorem of Gromov [13]. Unlike the 
work of Kuksin [21] which initiated the investigation of non-squeezing results 
for infinite dimensional Hamiltonian systems, the nonsqueezing argument here 
does not construct a capacity directly. In this way our results are similar 
to those obtained for the NLS flow by Bourgain [3]. A major difficulty here 
though is the lack of any sort of smoothing estimate which would allow us to 
easily approximate the infinite dimensional KdV flow by a finite-dimensional 
Hamiltonian flow. To resolve this problem we invert the Miura transform and 
work on the level of the modified KdV (mKdV) equation, for which smoothing 
estimates can be established. 
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1. Introduction 

This paper is concerned with the symplectic behavior of the Korteweg-de Vries 
(KdV) flow 

(1.1) u t + u xxx = 6uu x ; u(0,x) = u (x) 

on the circle x G T := R/27rZ, where u(t, x) is real-valued. In particular we in- 
vestigate how the flows may (or may not) be accurately approximated by certain 
finite-dimensional models, and then use such an approximation to conclude a sym- 
plcctic non-squeezing property. In order to describe the symplectic space involved, 
and state the result precisely, we need to set notation and recall some previous 
results describing the well-posedness of the initial value problem (1.1). 

On the circle we have the spatial Fourier transform 

1 f 27r 

(1.2) u(k) := — / u(x) exp(— ikx) dx 

27r Jo 

for all k e Z, and the spatial Sobolev spaces 

\\u\\ H s := (27r) 1 /2||(fc)'«|| I j 

for s e R, where (k) := (1 + |fc| 2 ) 1/2 - These are natural spaces for analyzing the 
KdV flow. 

Let P denote the mean operator 

P u :-- 

or cquivalently 

P^u(k) 




= Xk=ou(k). 
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The KdV flow is mean-preserving, and it will be convenient to work in the case 
when u has mean zero 1 . Accordingly we define the mean-zero periodic Sobolev 
spaces Hq by 

H S Q := {ueH s x : Pqu = 0} 
endowed with the same norm as H*. 

Recent work on the local and global well-posedness theory in Hq for (1.1) is basic 
to our results here. For example, the geometric conclusions from finite-dimensional 
Hamiltonian dynamics which we ultimately need for our nonsqueezing result can 
only be applied in the setting of rather rough solutions to the initial value problem 
(1.1). We now pause to summarize some of the analytical techniques that have 
been developed for the study of such rough solutions, and the resulting regularity 
theory (see e.g. [1], [18], [6], and [8], [9]). 

1.1. Summary of local and global well-posedness theory. If the initial datum 
u for (1.1) is smooth, then there is a global smooth solution 2 u(t) (see e.g. [26]). We 
can thus define the non-linear flow map Sxdvit) on C°°(T) by SKdv(t)uo := u{t). 
In particular this map is densely defined on every Sobolev space Hq. 

If s > —1/2, then the equation (1.1) is globally well-posed in Hq. In other 
words, the flow map SKdv{t) is uniformly continuous (indeed, it is analytic) on Hq 
for times t restricted to a compact interval [— T, T], and for such s we have bounds 
of the form 

(1-3) sup \\S K dv(t)uo\\ H s < C(s,T,\\u \\ H s), 

\t\<T 

(see [18], [8], [9] (and also Section 9.1 below)). For s < — 1/2 the flow map Sxdvit) 
is no longer uniformly continuous [6] (see also [19]) or analytic [4], so from the 
point of view which requires a uniformly continous flow in time, the Sobolev space 

— 1/2 

H is the endpoint space for the KdV flow. Coincidcntally, this space is also 
a natural phase space for which KdV becomes a Hamiltonian flow; we will have 
more to say about this at the end of the introduction. Note however that if one 
asks only that the flow be continuous in time, then global well-posedness for (1.1) 
has been established for all s > —1 in [16] using inverse scattering methods. Com- 
bining mapping properties of the Miura Transform and the result in [27], local 
well-posedness of (1.1) in Hq with a (not uniformly) continuous flow map holds for 
-5/8 < s < -1/2. 

To obtain many of the local and global well-posedness results mentioned above, 
one iterates in a certain spacetime Banach space Y s (defined in (3.1) below; this 
space is a variant of the X s ' b spaces used for instance in [1], [18]), which has the 



One can easily pass from the mean zero case to the general mean case by a Galilean transfor- 
mation u(t, x) —* u(t, X — P()(u)t) — P()(«). 

2 This result can also be obtained by inverse scattering methods, since the KdV equation is 
completely integrable. However, our methods here do not use inverse scattering techniques, al- 
though the special algebraic structure of KdV (in particular, the Miura transform [24]) is certainly 
exploited. 
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same regularity as H s in the sense that one has the embedding 3 

The nonlinearity is then placed in a companion space Z s (see (3.2) below), which 
is related to Y s via an energy estimate of the form 

lh(*Hk s ^ ||«(*o)||.ff« + IK + v, xxx \\z>, 

for any time to, and any bump function rj supported near to- (We will elaborate 
more upon these spaces and estimates in Section 3). The local well-posedness 
theory 4 for the KdV equation (1.1) then hinges on the bilinear estimate 

(1-4) \\(uv) x \\ Z s < \\u\\y>\\v\\y> 

whenever u,v are mean-zero functions and s > — 1/2 (see [18], [8], [9]). 

To pass from local well-posedness to global well-posedness one needs to obtain 
long-time bounds on the Hq norm. For — \ < s < 0, this has been achieved by 
means of the "/-method" , constructing an almost conserved quantity comparable 
to the H s norm; see [8], [9], or Section 9.1. 



1.2. Low frequency approximation of KdV. The KdV flow (1.1) is, formally 
at least, a Hamiltonian flow on an infinite-dimensional space. In order to rigor- 
ously apply results from symplectic geometry, we must approximate this infinite- 
dimensional flow by a finite-dimensional flow. Furthermore, in order to apply these 
geometric tools, we need that the finite-dimensional flow is itself Hamiltonian. 

We begin with a negative result. Suppose that we wish to study the KdV flow 
for data uo whose Fourier transform is supported on [— N, N] for some large fixed 
N, and specifically to approximate the KdV flow by a finite-dimensional model. A 
first guess for such a model might be the flow 

(1.5) u t + u xxx = P< N (6uu x ); u(0)=u , 

where P<n is the Fourier projection to frequencies < N: 

P< N u(k) = X\k\<Nu(k). 

Denote the flow map associated to (1.5) by Sp <N Kdv(t)- This flow has several 
advantageous properties; for instance, Sp <N Kdv(t) is a symplectomorphism on the 

— 1/2 _ 

space P<jyH , associated with a natural symplectic structure (see next subsec- 
tion). Since P<nHq 1/>2 is a finite dimensional space, it is easy to see (e.g. using L 2 
norm conservation and Picard iteration) that this flow Sp <N Kdv is globally smooth 
and well-defined. In [3], the NLS flow iut + u xx — \u\ 2 u was similarly truncated, 
and it was shown that the truncated flow was a good approximation to the original 
(infinite dimensional) flow. Unfortunately, the same result does not apply for KdV: 

3 In this paper we use A < B to denote an estimate of the form A < CB, where the implicit 
constant C may depend on certain parameters such as s which we will specify later in the paper. 
Similarly, A«B denotes B > CA for some such universal constant C. 

Strictly speaking, in order to handle large initial data one must also generalize this estimate 
to circles R/27rAZ of arbitrarily large period, in order to apply rescaling arguments to make the 
data small again. See [8], [9], or Section 9.1. 
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Theorem 1.1. Let k £ Z*, T > 0, A > 0. Then for any N > C{A,T, fc ) there 
exists initial data uo with \\uq\\ „-i/2 < A and suppfuo) C {|fc| < N} such that 

M o 

(1.6) |(Sx-^(T)uo)(*b) - (^p<„^(T) U o)(fco)| > c(T,A,fc ) 
/or some c(T, A, fco) > 0. 

In other words, Sp <N Kdv does not converge to Sifdv even in a weak topology. 

We prove this negative result in Section 8. Basically, the problem is that the 
multiplier X[-n,n] corresponding to P<n is very rough, and this creates significant 
deviations between SkcIV and Sp <N Kdv near the Fourier modes k — ±N. In 
cubic equations such as mKdV (see (1.9) below) or the cubic nonlinear Schrodingcr 
equation, these deviations would stay near the high frequencies ±N, but in the 
quadratic KdV equation these deviations create significant fluctuations near the 
frequency origin, eventually leading to failure of weak convergence in (1.6). 

Of course there are several obvious ways to modify the finite-dimensional flow 
(1.5) in an attempt to find an effective approximation to the KdV flow for data with 
Fourier transform supported on [— N, N], but at least a little bit of care is needed 
when considering these modifications. We let b(k) be the restriction to the integers 
of a real even bump function adapted to [—N,N] which equals 1 on [— N/2, N/2], 
and consider the evolution 

(1.7) u t + u xxx = B(6uu x ); u(0) = u 
where 

Bu(k) = b{k)u{k). 

Let Sbkclv denote the flow map associated to (1.7). Observe that this is a finite- 
dimensional flow on the space P<nH^. Unfortunately, SbkcIV is n °t a symplcc- 
tomorphism, but we will explain in (1.27) below how by conjugating a flow of the 
form (1.7) with a simple multiplier operator we will arrive at our desired finite 
dimensional symplectomorphism on P<nH~ 2 (T) that well- approximates the full 
KdV flow at low frequencies. This desired symplectomorphism is labelled S^J v (t) 
111 (1.27) below 5 , and once the aforementioned approximation properties are estab- 
lished, the nonsqucezing result will follow almost immediately after quoting the 
finite dimensional nonsqucezing result of Gromov [13]. 

The first step in the argument is to show we can approximate Sk<iv by SbkcIV 
in the strong Hq topology: 

Theorem 1.2. Fix s > -1/2, T > 0, and N > I. Let u <G H$ have Fourier 
transform supported in the range \k\ < N. Then 

sup \\P< N1/ 2(S B KdvUo(t) - S K dv(t)u )\\ HS < N- a C(s,T,\\uQ\\ H s) 

\t\<T 

for some a = a{s) > 0. 



5 The equation which defines this flow is given in (7.1) below. 
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In particular, we can accurately model the KdV evolution for band-limited initial 
data by a finite-dimensional flow, at least for frequencies |fc| < iV 1 / 2 . 

The well-posedness statement (1.3) gives Theorem 1.2 for all < N < C(s, T,\\uo\\h^,\\uo\\h^), 
hence our proof needs only to consider N > C(s, T, \\uo\\h° , \\uq\\h°)- This turns 
out to be the most interesting case from the point of view of the nonsqucczing 
applications of this approximation theorem which we take up below. 

Theorem 1.2 can be viewed as a statement that one can (smoothly) truncate 
the KdV evolution at the high frequencies without causing serious disruption to 
the low frequencies, in spite of the obstruction posed by Theorem 1.1. Our second 
main result (proven in Section 5) is in a similar vein: 

Theorem 1.3. Fix s > -1/2, T > 0,N > 1. Let u ,u e H$ be such that 
P<2NUo = P<2NUq (i-e. u and u agree at low frequencies). Then we have, 

sup \\P<N(S K dv{t)u - S Kdv {t)u )\\ H s <N-°C( S ,T,\\u \\ H s,\\u \\ H s) 

\t\<T 

for some a = a(s) > 0. 

By the same reasoning made following Theorem 1.2, we may assume in the proof 
of Theorem 1.3 that N > C(s,T, \\u \\h° , \\u \\h°). 

The point of Theorem 1.3 is that changes to the initial data at frequencies > 27V 
do not significantly affect the solution at frequencies < TV, as measured in the strong 
Hq topology. This is in stark contrast to the negative result in Theorem 1.1. The 
point is that there is some delicate cancellative structure in the KdV equation which 
permits the decoupling of high and low frequencies, and this structure is destroyed 
by projecting the KdV equation crudely using (1.5). 

To prove Theorem 1.2 and Theorem 1.3, we shall need to exploit the subtle 
cancellation mentioned in the previous paragraph in order to avoid the obstructions 
arising from Theorem 1.1. We do not know how to do this working directly with 
the KdV flow. Rather, we are able to prove estimates which explicitly account for 
this subtle structure in KdV by using the Miura transform u = M», defined by 

(1.8) u = Md := v x +v 2 - P Q (v 2 ). 

As discovered in [24], this transform allows us to conjugate the KdV flow to the 
modified Korteweg-dc Vries (mKdV) flow 

(1.9) v t + v xxx = F(v); v(x,0) =v (x) 
where the non-linearity F(v) is given by 

(1.10) F(v) ~6(v 2 - P (v 2 ))v x . 

The modified KdV equation has slightly better smoothing properties 6 than the or- 
dinary KdV equation, and in addition the process of inverting the Miura transform 
adds one degree of regularity (from Hq 1 ^ 2 to H^ 2 ). In particular, the types of 
counterexamples arising in Theorem 1.1 do not appear in the mKdV setting, and 



6 See Section 4, in particular Theorem 4.3. 
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by proving a slightly more refined trillinear estimate than those found in e.g. [9] 
(see in particular Theorem 4.3 below) we are able to prove the above two theorems 
by passing to the mKdV setting using the Miura transform. Of course, in order to 
close the argument we will need some efficient estimates on the invertibility of the 
Miura transform; we set up these estimates (which may be of independent interest) 
in Section 2. 



1.3. Application to symplectic non-squeezing. We can apply the above ap- 
proximation results to study the symplectic behavior of KdV in a natural phase 
space 77 " 1 ^ 2 (T). Before doing so, we recall some context and results from previous 
works. We are following here especially the exposition from [15, 22]. 

Definition 1.4. Consider a pair (H, u>) where lu is a symplectic form 7 on the Hilbert 
space H. We say (H, ui) is the symplectic phase space of a PDE with Hamiltonian 
H[u(t)] if the PDE can be written in the form, 

(1.11) u(t) = JVH[u(t)}. 



Here J is an almost complex structure 8 on H, which is compatible with the 
Hilbert space inner product (•, •). That is, for all u, v e H, 

(1.12) u(u,v) = (Ju,v). 

The notation V in (1.11) denotes the usual gradient with respect to the Hilbert 
space inner product, 

(1.13) (v, VH{u}) = dH[u](v) 

(1.14) =*-\ e=oH [ u + ev ]. 

One easily checks that an equivalent way to write the PDE corresponding to the 
Hamiltonian H[u(t)] in (H, w) is 

(1.15) u(t) = V u H[u{t)] 

where the symplectic gradient V^H [u] is defined in analogy with (1.13), 

(1.16) w(v,V u H[u]) = dH[u](v). 

_ i 

For example, on the Hilbert space H 2 (T), we can define the symplectic form 

(1.17) lo_i(u,v) := / u(x)d~ 1 v(x) dx 

2 Jt 

where : _ff _1 ^ 2 (T) — ► H^ 2 (T) is the inverse to the differential operator d x 
defined via the Fourier transform by 

dFf(k) := l/(fc). 



^That is, a nondegenerate, antisymmetric form u:IxI^C. We identify in the usual way 
and it's tangent space T X M for each 

8 That is, a bounded, anti-selfadjoint operator with J 2 = —(identity). 
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The KdV flow (1.1) is then formally the Hamiltonian equation in (H (T), uj_ i ) 
corresponding to the (densely defined) Hamiltonian 

(1.18) H[u] := f \u 2 x +u z dx. 

.It 2 



Indeed, working formally 9 we have for any v € H 2 (T), 



-ff [w + cv] = / iiji/j + 3u 2 vdx 
Jt 

= / (-Uu + 3u 2 )vdx 
Jt 

= / d' 1 (-u xxx + 6uu x )vdx 
Jt 

(-u xxx + Quu^d^vdx 



T 

= lo_i{u xxx - 6uu x ,v) 

2 v ' ' 

= (jJ_i(v, -u xxx + 6uu x ). 

Comparing (1.15)-(1.16) with (1.1), we see KdV is indeed the Hamiltonian PDE 

_ i_ 

corresponding to H[u] on the infinite dimensional symplectic space (H 2 , u>_ i ). In 
particular, the flow maps SKdv{t) are, formally, symplectomorphisms on i?J" 1 ^ 2 (T). 

That the KdV flow arises as a Hamiltonian flow from a symplectic structure as 
described above was discovered by Gardner and Zakharov-Faddeev (see [12, 31]). 
A second structure was given by Magri [23] using j u 2 dx as Hamiltonian, but it 
is not as convenient as the first structure for our strategy to prove nonsqucezing. 
Roughly speaking, it seems the symplectic form in this second structure could 
possibly be used to establish a nonsqueezing property - in the H^^ topology - 
of a finite dimensional analog of (1.1). However, since the well-posedness theory, 
and the accompanying estimates, for the full KdV flow do not presently exist at 
such rough norms, we do not see how we could approximate the full KdV flow 
in a space as rough as H~2 with a finite dimensional flow. The first structure 

described above allows us to adopt this strategy in the space H Q 2 , within which 
we do have well-posedness. (See below for references for this approach to proving 
nonsqueezing for PDE. See e.g [25, 11] for more details and history of the various 
symplectic structures for KdV.) 

For any Ulf G H Q V "(T), r > 0, k e Z*, and z € C, we consider the infinite- 
dimensional ball 

B°°K;r) :={ue H- 1/2 (T) : \\u-u4 H - 1/2 < r} 
and the infinite-dimensional cylinder 

C^(z;r) :={ueH- 1/2 (T) : |fco|- 1/2 |u(fc ) - z\ < r}. 

"By the word 'formally', we mean here that no attempt is made to justify various differen- 

_ i 

tiations or integration by parts. Later, when we localize the space H 2 and Hamiltonian in 
frequency and write down the corresponding equations, the reader can carry out the analogous 
computation where the justification of the necessary calculus will be evident. 
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The final result of this paper is the following symplcctic non-squeezing theorem, 
Theorem 1.5. Let < r < R, u* £ H~ 1/2 (T), k £ Z* , z £ C, and T > 0. Then 
S Kdv (T)(B°°(u^R))%C^(z;r). 

— 1/2 

In other words, there exists a global H (T) solution u to (1.1) such that 

\\u(0) - uJ\ „-i/2 < R 

and 

\k Q \- 1 / 2 \u~(T)(k )-z\>r. 

Note that no smallness conditions are imposed on u*, R, z, or T. 

Roughly speaking, this Theorem asserts that the KdV flow cannot squash a large 
ball into a thin cylinder. Notice that the balls and cylinders can be arbitrarily far 
away from the origin, and the time T can also be arbitrary. Note though that 
this result is interesting even for = 0, z = and smooth initial data u$, as 
it tells us that the flow cannot at any time uniformly squeeze the ball B°°(0,R) 
even at a fixed frequency fen. By Theorem (1.5), the well-poscdness theory for 
KdV reviewed above, and density considerations, we know that for any T, r < R, 
there will be some initial data u £ B°°(0:R) for which 10 \u(k ,T)\ > \ko\^r. 
(See [5], page 96 for the same discussion in the context of a nonlinear Klein-Gordon 
equation.) A second immediate application of Theorem 1.5 to smooth solutions was 
highlighted in a different context already in [21], namely that such smooth solutions 
of (1.1) cannot uniformly approach some asymptotic state: for any neighborhood 
B°°(uq;R) of the initial data in H~2(T) and for any time t, the diameter of the 
set SKdv(t)(B°° (uo; R)) cannot be less than R. 

The motivation for Theorem 1.5, and an important component of its proof, is 
the finite-dimensional nonsqueezing theorem of Gromov [13] (see also subsequent 
extensions in [14], [15]). The extension to the infinite-dimensional setting provided 
by a nonlinear PDE seems nontrivial. The program was initiated by Kuksin [21], 
[22] for certain equations where the nonlinear flow is a compact perturbation of the 
linear flow. That the KdV equation doesn't meet this requirement can be seen by 
an argument involving simple computations similar to those supporting Theorem 
1.1 which are detailed in Section 8 below: fix a <C 1 and for each integer N > 1 
consider initial data, 

uo,n(x) := crN? cos(Nx). 

_i 

Clearly the set {uq.n ■ N — 1,2,...} is bounded in H 2 . However, when one 
computes the second iterate 11 one sees that it differs from the linear evolution 

10 We arc using here the statement of the Theorem only in the case u* = 0, z = 0. Of course 
one gets a similar conclusion to the one we draw here, but with different weights and a different 
initial data set, by simply using the L 2 conservation and time reversability properties of the flow. 
That is, for any R> r, there is data uo € {||/|Il 2 (t) ^ ^} sucn that the evolution u of this data 
satisfies |u(fcrj,T)| > r. 

See in particular equation (8.2) for the notation used here, and if necessary Section 8 for 
what wc hope is a sufficiently detailed discussion to allow the reader to reproduce the elementary 
computations we quote here. 
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of Ujy at frequency k — N in that, 

(1.19) $ ] (N,t)-$ ] (N,t) ~Nia 3 e iN3t . 

By the local well-posedness theory we know, assuming a is sufhciently small com- 
pared to t, that the difference between the second iterate and the actual nonlinear 
evolution upf{t) of the data u ,n satisfies, 

(1-20) hN(t)-u§(t)\\ i <a 4 . 

Together, (1.19) and (1.20) show that if {N k } is a sequence of integers relatively 
prime to one-another 12 , then 

u^ k (N h t) - u [ ° ] k (N h t) ~ 5 ktl ■ <j 3 ■ iv|e iJV *\ 

Hence the set {uN k (t) — u l N ' k (t)} has no limit point in H 2 (T). 

The nonsqueezing results of Kuksin were extended to certain stronger non- 
linearities by Bourgain [3, 5] - for instance [3] treats the the cubic non- linear 
Schrodinger flow on L 2 (T). In these works, the full solution map is shown to be 
well-approximated by a finite dimensional flow constructed by cutting the solution 
off to frequencies |fc| < TV for some large N. The nonsqueezing results in [3, 5] 
follow then from a direct application of Gromov's finite dimensional nonsqueezing 
result to this approximate flow. 

The argument we follow here for the KdV flow is similar to the work in [3, 5], 
but seems to require a bit more care. The complication seems to us to be somehow 
rooted in the counterexample of Theorem 1.1, which clearly exhibits that a sharp 
cut-off is not appropriate in constructing the approximating flow, but which seems 
also to be subtly related to the fact that the estimates necessary to approximate 
the full KdV flow by a more gradually truncated flow are unavailable to us when 
we work directly with the KdV equation. We have already sketched how we will 
deal with this difficulty (that is, by passing to the modified KdV equation) in the 
discussion which followed Theorem 1.3 above. 

We now provide some details of the previous paragraph's sketch, in particular 
we indicate the difficulties that arise when one tries to repeat the argument in [3, 5]. 

Let N > 1 be an integer. By simply restricting the form the space 

(P<nH 1//2 (T), lo_ i ) is a 27V-dimensional real symplectic space and hence by gen- 
eral arguments (see e.g. Proposition 1 in [15]) is symplcctomorphic to the standard 
space (R 2Ar ,wo)- We will make explicit use of such an equivalence below: any 
u e P<nH (T) is determined completely by 



(Re(«(l)),...,Re(u(JV)),Im(u(l)),...,Im(u(JV))) 
(Li " =(ei («),..., e n (u),f 1 (u),...,Mu))e™^ 



Note (for example by examining the iterates and using well-posedness) that fijv(*) is sup- 
ported only at frequencies which are integer multiples of N. 
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In terms of the coordinates (1.21) the form u_i denned in (1.17) can be written 
using the Plancherel theorem as, 

N 1 
w_i («,«)= u{-k)—v{k) 

k = -N 
N 1 

k=i 

N o 

= ^-(Im(«(fc)u(fc))) 

fe=i 
W 2 

= tM u ) ■ fk(v) - e k (v) ■ f k (u)). 
fe=i K 

Write T for the N x N matrix T = diag(l, 73, 75, • • • 737), A = diag(r,T), and 

u = (e(u), f(u)) e R 27V for the coordinates in P<pfH 1 ^ 2 (T), we summarize the 
discussion above by saying, 

(1.22) = w (A(e»,/(u)),A(e», />))), 

where as before we've written wo for the standard symplectic form on M. 2N . In other 
words, 

A : (P< n H- 1/2 (T),lu_i) - (P 27 V ) 
is a symplectomorphism. 

Following [3] , our goal is to find a flow which satisfies three conditions: it should 
be finite dimensional - that is, map P<nH~ 2 (T) into itself; it should be a symplectic 
map for each t; and it should well-approximate the full flow Sicdv(t) in a sense that 
we will make rigorous momentarily. For now, we write So^odiW f° r tms y e * to be 
determined flow. 

{P< n H~\cj_i) — ^— (R 2A, ,^ ) 

(1.23) sL«) v 

(P^o" 1 ^-!) — : — > (R 27 Vo) 

2 A 

Note then that the map, 

(1.24) AoS^^oA" 1 : (R 2N ^ ) — (R 27 V ) 

is likewise a symplectomorphism to which we can apply the finite dimensional 
theory of symplectic capacity (see [13], and e.g. [15]). One defines, for any 
x* S R 2N ,ui N) e P< N H~ 1/2 (Y), r > 0, < |fc | < N, and z e C, the finite- 
dimensional balls in P<Ari?Q~ 1//2 (T), i? 2W , respectively, by the notation, 

(1.25) B^ufV) := {«W G P <JV P - 1/2 (T) : - i^l^-v < r} 

(1.26) B(x*,r) := {xeR 2N : \x - x*\ < r}. 
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and the finitc-dimcnsional cylinders in the same spaces by, 

Cl(z-r) := { U W £ P< N H- 1/2 (T) : | fc 1" 1/2 1 -^^H^o ) - z\ < r} 
C k0 (z:r) := {(e, /) e R 2N : |(e feo + V^T/ feo ) - z| < r)}. 

From [13], (see also e.g. Theorem 1, Page 55 in the exposition [15]) we have the 
finite-dimensional analogue of Theorem 1.5: 

Theorem 1.6 ([13]). Assume that for some R, r > 0, z £ C, < k < N, £* £ R 2N 
there is a symplectomorphism <fi defined on B(x*,R) C (M 27V ,oj ) so that 

<l>(B{x*,R))cC ko {z;r). 

Then necessarily r > R. 



We apply this theorem to the symplectomorphism A o S^2, d , o A 1 defined in 
(1.24) above to conclude, 

Theorem 1.7. Let N > I, < r < R, ui N) £ P< N H~ 1/2 {Y), < |fc | < N, 
z £ C, and T > 0. Let S ( ^ l d! {T) : P< N H~ 1/2 (T) -» P< N H~ 1/2 (T) be any 
symplectomorphism. Then 

Sgl d! (T)(B N (u^-,R))^CZ(z-,r). 



To deduce Theorem 1.5 from Theorem 1.7, one would like to let N — > oo and 
show that the flow So2 od ,(T) converged to SkcIv(T) m some weak sense. More 
precisely, one would need, 

Condition 1.8. Let k £ Z* , T > 0, A > 0, < e < 1. Then there exists an 
No = N (k ,T,s,A) > \k \ such that 

\k Q \- 1/2 \S K ^AT)u Q {k Q ) - S^ o l d! {T)u {k )\ « e 
for all N > N and all w & B w (0, A). 



Once we find a finite dimensional symplectic flow Sq qo(1 , (t) for which Condition 
1.8 holds, it is an easy matter to conclude Theorem 1.5. Indeed, let r, i?, u*, fc , z, T 
be as in that Theorem, and choose < e < (R — r)/2. The ball B°°(u st ; R) is con- 
tained in some ball B°°(0; A) centered at the origin. We choose N > N (ko,T, e, A) 
so large that llw* — P<atu*| „-i/2 < e. From Theorem 1.7 we can find initial data 

€ P<Arif _ 5(T) satisfying \\u^ — P<nu«\\ h -i/2 < R — e, and hence by the 
triangle inequality, 

\\u N) - u*||„-i/2 < R, 
and so that at time T we have, 

\ko\- 1/2 \S { G N J^)ui N \k )-z\>r + e. 
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If we then apply Condition 1.8 and the triangle inequality we obtain Theorem 1.5 
with u := Uq N \ 



|fc |- 1/2 k- S Kdv (T)u { N \k )\ > 



\ko\ 



-1/2 



> r + e - e = 



r. 



It remains to define the flow SQ ood ,(t). One might first try to follow Bour- 
gain's treatment of several different Hamiltonion PDE, notably the cubic NLS flow 
on L 2 (T) (see [3], [5]). Note that the Hamiltonian H[u] (1.18) is well-defined on 
(P<jv_ff (T), and the equation giving the corresponding Hamiltonian flow 

on this space can be computed as before to be (1.5), which can be viewed either as 
a PDE or as a system of 2N ODE. The maps Sp <N Kdv(t) are therefore symplec- 
tomorphisms, but from Theorem 1.1 we know that Condition 1.8 fails. 

We proceed instead by using a flow of the form (1.7) as follows: Theorem 1.2 
tells us that for any multiplier B of the form described in (1.7), the finite dimen- 
sional flow Sg Kdv provides a good approximation to the low frequency behavior of 
KdV However, the flows Sg Kdv are not symplectomorphisms, and hence cannot 

be candidates for our flow S^ood!^) in the discussion above. Fortunately, there is a 
quick cure for this hiccup using the approximation given by Theorem 1.3 as follows: 

we will define a symplectic, finite dimensional flow S)^J v (t) on P<nH q 2 so that 
the following diagram commutes. 

_ 1 g 

u e P<nH 2 ► Bu 

(1.27) sfflv(*)j [s B 2 Kdv W 

We write explicitly the PDE defining this flow in (7.1) below. To show that S^2 v (t) 
well approximates SKdv(t) at frequency fco> and hence qualifies as our choice of 
^GoodiW' we wm simply spell out the following: Theorem 1.3 allows us to replace 
SB 2 Kdv(t) on the right side of (1-27) with SKdv{t)\ and our choice N ^> |fco| allows 
us to ignore both the mappings on the top of (1.27) (again, by Theorem 1.3) and 
the bottom of (1.27) (by the definition of B, this is the identity at frequency fc ). 
We give the details in section 7 below. 

Acknowledgements. This work was conducted at UCLA. The authors are in- 
debted to Tom Mrowka for his detailed explanation of symplectic non-squeezing. 



2. Inverting the Miura transform 



As described in the introduction above, our work here on the KdV equation 
relies on the continuity and invertibility properties of the Miura transform u = M», 
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where M is defined by (see [24]), 

Md :=v x + v 2 -P a (v 2 ). 

The additional Po{v 2 ) term here is necessary to make the mean of M» vanish. Let 
S m Kdv{t) denote the flow associated to the mKdV equation (1.9). Then we have 
the intertwining relationship 

(2.1) MS mKdV (t) = S KdV (t)M. 

To see this, we suppose that v solves the mKdV equation (1.9), and set u := Md. 
Then one easily checks, 

ut + u xxx - 6uu x = (d x + 2v)v t + (d x + 2v)v xxx + 6v x v xx 

- 6(v x + v 2 - P {v 2 ))(v xx + 2vv x ) 

= (d x + 2v)(v t + v xxx - 6v 2 v x + 6P (v 2 )v x ) 

= 0. 

Heuristically, the Miura transform acts like a derivative operator d x , and in 
particular we expect it to be a locally bilipschitz bijection from to Hq . The 
purpose of this section is to make this heuristic rigorous for the range s > 1/2. (See 
also [17], which studies the Miura transform for the larger range s > 0.) 

In what follows we shall make frequent use of the well-known Sobolev multipli- 
cation law 

(2-2) ||««||j/»(t) < II m II_h" s i(t)II u IIh s 2(t) 

whenever s < min(si, s 2 ) and s < si+s 2 — ^, with at least one of the two inequalities 
being strict. 

From (2.2) it is clear that M is a locally Lipschitz 13 map from H$(T) to H^ 1 ^) 
for s > 1/2 (in fact s > would suffice). The main result of this section is to invert 
this statement: 

Theorem 2.1. Let s > 1/2. Then the map M is a bijection from Hq(T) to 
#o _1 ( T ); an d the inverse map M 1 is a locally Lipschitz map from 1 (T) to 
HS(T). ' 

Proof. We shall focus on the endpoint case s = 1/2. We shall see at the end of the 
proof that the higher regularity cases s > 1/2 then follow from the endpoint case 
and standard elliptic regularity theory. We remark that the arguments here (based 
on a variational approach) are unrelated to the rest of the paper and can be read 
independently. 

Since the linearization v \— > v x of the Miura transform M is clearly bilipschitz 
from H^ 2 (T) to H Q 1 ^ 2 (T) it is tempting to treat the lower order terms v 2 — 
Po(v 2 ) as perturbations to be iterated away. This works well if v and M(u) are 
small, however for large v it appears that iterative techniques alone cannot obtain 



By this we mean that M is Lipschitz on every ball in ffQ(T), with a Lipschitz constant 
depending on the ball. 
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this result 14 . Indeed, we shall need to also rely on variational techniques, and in 
particular we will use the well-known connection between the Miura transform and 
the spectral theory of Schrodinger operators. The key identity here is 

(2.3) (A + v){ -± + v) = -^ + {Vx + J) = + M(„) + P (A 



We shall work entirely with the smooth functions in H^ 2 (T) and H 1 ^ 2 (T), 
and obtain bilipschitz bounds for M on these functions; it will then be clear from 
standard limiting arguments that one has bilipschitz bounds in general. 

— 1/2 

Let u G H (T) be smooth. We consider the problem of finding a smooth 

1/2 

function v G H ' (T) with Mv = u, showing this v is unique and of estimating v in 
terms of u. This will be achieved by studying the self-adjoint Schrodinger operator 
L = L u defined by 

L:=-— 2+ u{x) 
and the associated energy functional E[4>] = E u [4>] defined on i/ 1 (T) by 

E[<f>] := (Lfafa = [ <g{x)+u{x)cj > 2 {x) dx. 

Since L is a self-adjoint elliptic operator on a compact manifold T, it has a discrete 
spectrum Ai < A 2 < . . . with A„ — > +oo. In particular we have a lowest eigenvalue 
Ai = Ai(m) G M, and a non-zero (real-valued) eigenfunction fa with Lfa — X\fa. 
A priori fa is only in _ff 1 (T), but since u is smooth one can use the equation 
Lfa = Xifa to deduce that fa is also smooth. 

Our analysis here shall rely solely on Ai. It is interesting to note that the work 

— 1/2 

in [4], which is at a similar level of scaling to H (T), uses the entire spectrum 
A n of the operator L. 

From construction of E[<j>] we observe that 

(2.4) E[<j>] > Ai lfa 2 

Jt 

for all (j> G 7J 1 (T), with equality attained if and only if is a Ai eigenfunction of L. 
(As we shall see, Ai is an isolated eigenvalue, so equality only occurs when <j> = cfa 
for some c.) Thus Ai can be described in a variational manner. 

— 1/2 

Since u G H (T) we see that E[l] — 0, thus Ai must be non-positive. If 
u ^ then 1 is not an eigenfunction, and so Ai becomes strictly negative. 

We now claim that fa cannot vanish anywhere. If it had a double zero at some 
point, i.e. fa(x n ) = d x fa(x n ) = 0, then from the second-order ODE Lfa = Xifa 

14 However, iterative techniques do allow us to bootstrap low regularity estimates to high 
regularity estimates, basically because M is elliptic and v lies above the critical regularity H^ 1 / 2 
for M (and for mKdV). The strategy of this argument will be to use variational estimates to 
obtain a preliminary estimate in very rough norms, and use iteration to improve this to estimates 
in the correct norms H^ 2 (T) and H Q 1 ^ 2 (T). 
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and the Picard existence theorem for ODE we see that 4>\ = 0, a contradiction. 
Now suppose that <f>\ had a simple zero at xq, so in particular <j>\ changed sign. Let 
<f>i = <f>l + (f>i denote the positive and negative components of <j>\. An integration 
by parts shows that 

E[tt]= [ lm*)M*) = *i [ <t>t(?) 2 - 

J01>O J 

This implies that <p~l is a Ai eigenfunction of L, which contradicts the fact that 
all such eigcnfunctions are smooth 15 . Thus 4>\ is nowhere vanishing; without loss 
of generality we may take to be positive and L 2 -normalizcd (which uniquely 
identifies <j>i). If we now define v to be the logarithmic derivative of <\>\ 

d x cj)i(x) 

then v is smooth and we have 

dxx<Pl ,d x <j>i 2 2 
V x = — (—7—) =U-Xl-V 

(since L(f>i = Ai</>i) and hence 

u = v x + v 2 + Ai. 
Taking means of both sides we see that 

(2.5) -Ai = P (v 2 ) 
and hence u = Mw. 

This shows existence of v such that u = Mv. Observe from (2.3) and an 
integration by parts that 

(2.6) Eft] = J(4> x - v<j>) 2 dx - P (v 2 ) J <b 2 ; 

from this and (2.5) we immediately see that (2.4) holds (which we already knew), 
and that equality occurs if and only if <f> x — v<f>, or in other words if is a constant 
multiple of exp(d~ 1 v). In particular this shows that v is unique, for if we had 
M.v = Mv then the above argument yields exp(d~ 1 v) is a constant multiple of 
exp(9~ 1 w), which implies v = v if v, v both lie in H^ 2 (T). 

We have now shown that M is smooth, locally Lipschitz, and bijective on smooth 
functions with mean zero. To extend this to iJ 1/2 (T) and H a 1/2 (T) we need some 
a priori estimates on M -1 in these norms. 



Let u G H Q 1 ^ 2 (T) and v € H^ 2 (T) be smooth functions such that u = Mw. For 

— 1/2 

this discussion we will allow implicit constants to depend on the H (T) norm of 
u. Write U := d~ 1 u 1 thus < 1. We observe from integration by parts, 



15 Alternatively, one can smooth <j>^ at the zeroes of (f>\ to contradict (2.4). 
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Holder, Sobolev, and Gagliardo-Nirenbcrg that 

m = j ^ + j ^ 

= U\\% 1 -2 J VM* 

> 11011^1 -cii^iUsii^Liii^lUs 
>Uf ti ,-c\\v\\ H y, {Y p\\ HV ,u\\ kl 

In particular we have the coercivity bound 

m+c\\nh>H\\ 2 H1 

for all 4> G _ff 1 (T). Applying this to (j) = (f>i in particular and recalling the upper 
bound on Ai we obtain the eigenvalue bound 

(2.7) -C < Ai < 

and the preliminary cigenfunction bound 

WM* <i- 

— 1/2 

From (2.2) and the H 1 (T) bound on u we thus have 

\\u(j) 1 \\ H -l/2 < 1 

which by the eigenfunction equation L<f>\ — Xi4>i implies the better eigenfunction 
bound 

(2-8) ||0i||ff3/ 2 <l. 

Now we estimate v. From (2.5) and (2.7) we have the preliminary bound 

IHI^<i; 

since u — M«, we thus have 

\\v x - "Ik 1 ^ 1- 

Since L 1 and H^^iT) both embed into _ff~ 3 / 4 (for instance) we thus have by 
Sobolev that 

IMU 4 ^ ll U ll ff l/4 < lkxllff-3/4 < 1. 

Returning once again to the equation u = M.v, we thus have 

\\v x -u\\ L 2 < 1 

which then implies 

(2-9) IMI*v»ci) S L 

In particular we have 

\\dx ^ W d x lv \\ H 3 /2 (T) ~ H w IIh 1/2 (t) ~ 1 ' 

and thus exp(9 2 r 1 i;) is bounded above and below. Since <j>i is a constant multiple 
of exp(9~ 1 w), we thus see from (2.8) that 

(2.10) |0i (ar) | ~ 1 for all ieT. 
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We have obtained good bounds for v = M (u) and for the ground state 4>\. 

We now establish that M _1 is Lipschitz for smooth v in a given bounded subset 

1 li 

of P . From the inverse function theorem and the fact (from (2.2)) that M is a 
locally uniformly C 2 map from H^ 2 to Hq 1 ^ 2 , it suffices to show that the derivative 
map M'(u) : H^/ 2 — ► H 1 ^ 2 is uniformly invcrtiblc for v in this set. 

A direct computation shows 

M'(v)(w) = (l-P )(d x + 2v)w 

We shall invert this explicitly. 

Lemma 2.2. We have 

M» _1 = A[cxp(-2d^ 1 v)}d- 1 A[exp{2d- 1 v)} 

where for any positive function V G P 3 / 2 (T), A[V] : P ±1/2 (T) -» P ±1/2 (T) is the 
operator 

A[V](w) :=Vw-^^P (Vw). 

We recommend that the reader think of M'(u) and M'(i)) -1 as perturbations 
of d x and d~ 1 respectively. 

Proof. We have 

M» = (1 - P ) cxp(-2d x 1 v)d x exp{2d x 1 v) 

= (1 - P ) exp(-29- 1 w)a x (l - P ) exp(29- 1 «). 

Also, observe that -A[V] is the inverse of (1 - Po)^ 1 on P ±1/2 (T). The claim 
follows. □ 



Since P 3 / 2 is a Banach algebra (by (2.2)), we have 
(2.11) || cxp(±2d~ 1 v)\\ H 3/2 < exp(C||9 2 r 1 w|| fl -3/2) < exp(C||w|| ff i/2^) < 1. 

Thus from Lemma 2.2 we see that M'(u) _1 is uniformly bounded from H 1 ^ 2 to 

#1/2 

We have now proven Theorem 2.1 at the endpoint s = 1/2. We now sketch 
how one can use elliptic regularity theory to bootstrap this to higher regularities 
s > 1/2. 

Let us first show the boundedness of M _1 from Pg _1 to Pg for smooth func- 
tions. In other words, if u = Mu is smooth, we wish to show that 

\Mh° < C(||u|| ff .-i). 

From the P 1 / 2 theory we already know that 

IMIffVs ^ C(\\u\\ H .-i). 
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Suppose for the moment that 1/2 < s < 3/2. We write 

M\h S < \\v x \\ H --* 

<\\Mv\\ H .-i + \\{l-B )v i \\ H .- 1 
< \H\ Hr + ||« 2 || H .-i. 
If s < 3/2, then by (2.2) we see that \\v 2 \\ H s-i < \\v\\ 2 1/2 < C(\\u\\ H e-i), which 

H (j o 

establishes boundedness. By iterating this type of argument again one can cover the 
case 3/2 < s < 5/2, and so forth until we obtain boundedness for all s > 1/2. The 
local Lipschitz property for M _1 is proven similarly and is left to the reader. □ 

From the above Theorem, the analyticity of M, and the inverse function theorem 
we see in fact that M _1 is locally uniformly C m as a map from ffg -1 (T) to Hq(T), 
for any integer m and any s > 1/2. 

3. The Fourier restriction spaces Y s and Z s 

In view of the results of the last section, we see that to analyze the KdV flow 
in the H^ -1 topology it will suffice to analyze the mKdV flow in the Hq topology. 
We now review the basic machinery (from [1], [18], [8], [9]) for doing so. 

If u(x, t) is a function on the cylinder T x R with mean zero at every time, and 
s,kl, we define the X s ' b = X s > h (T x R) norm by 

INI*.,* :=\\u{k,T){kY{T-e) b \\ Llk 

where L 2 . k is with respect to Lebesgue measure dr in the r variable and counting 
measure in the k variable, (x) 2 = (1 + \x\ 2 ), and the space-time Fourier transform 
u(k, t) is given for k E Z* , r E R by 

u{k,r):= [ e- 2ni ( xk+tT) u(x,t) dxdt. 

JTxR 

We use the same notation here as for the purely spatial Fourier transform (1.2), 
relying on context to distinguish the two. 

We also need the spaces 
(3.1) ||u|| rs := \\u\\ Xs ,i /2 + \\{k) s u\\ L 2 L i 

and 

(k) s u 

(3-2) ||u||z. := |M|x 8 ,-i/2 + II ^TT-py N^- 

Observe that we have the crude estimate 
( 3 - 3 ) \\ u \\z- < IM|x=.° = \\u\\L 2 t H° 

which will be useful for controlling quartic or higher order error terms; often we 
will be localized in time and just estimate L 2 H S by LfH s . Here and in the sequel, 
we always allow implicit constants to depend on the exponent s. 
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We can restrict the space Y s to a time interval / C K in the usual manner as 

||ti||y/ := inf{||v||r« : u|tx/ = u}. 

Similarly we can restrict the Z s norm. In practice we shall work in a fixed time 
interval (usually [— T, T]) and implicitly restrict all of our norms to this interval. 

Now we give some embeddings for the Y s and Z s spaces. Since the Fourier 
transform of an L 1 function is continuous and bounded, we have from (3.1) that 

(3.4) Y s C C t H s x C L?H' X . 

We have the "energy estimate" , 
(3-5) Wv(tMY* < |K*o)||ir ' + \\vt + v xxx \\ Z s 

for any to € M and any bump function rj supported on [to — C, to + C]. ([1], see 
also Lemma 3.1, [9]; see Lemmas 3.1 - 3.3 in [20] for analogous estimates in the 
nonperiodic context.) 

Recall too the main estimate from [9] (see Proposition 1 in that paper) , namely, 

(k \ k 

((l-Po)n^K )llz.<(IIIHk.)IHk« 
1=1 / i=\ 

for any s > 1/2 and any integer k > 2, where the implicit constant depends on 
k. (We shall only use (3.6) with k = 2,3,4). This particular estimate is crucial 
(especially at the endpoint s = 1/2) in order to prove the local (and global) well- 
posedness of the modified KdV equation (1.9) in Hq(T) for s > 1/2. 

It would be very convenient if the Z s on the left-hand side of (3.6) could be 
replaced by Z s+a for some a > 0; this extra smoothing estimate would make it easy 
to ignore the high-frequency components of the evolution and concentrate on the 
low frequency evolution. Unfortunately it is easy to see (by modifying the examples 
in [18]) that such estimates fail, especially at s = 1/2. Fortunately, as we will see 
in the next section there are some other ways to improve the trilinear version of 
(3.6) which will be useful for our approximation results. 

4. An improved trilinear estimate 

The estimate (3.6) with k = 2 allows us to estimate the cubic nonlinearity F(v) 
defined in (1.10). However for our analysis we shall need a refined version of this 
estimate. 

The first step is to decompose F into "resonant" and "non-resonant" compo- 
nents. In the following analysis we shall always assume that v has mean zero. 

We start with the Fourier inversion formula 

v(x) — ^ v(k) exp(ikx) 
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for v € Hq, where Z* := Z\{0} is the set of the non-zero integers. A direct 
computation gives that the Fourier transform of F(v) is 

(4.1) F(v)(k) = 6 X) v{k 1 )v{k 2 )ik 3 v(k 3 ) 

ki ,fc2,fc3 6Z*:fei+fe2+fe3=fc;fci +k 2 ^0 

for all k e Z* . The constraint k\ + k 2 ^ arises since we have subtracted the mean 
Pq(v 2 ) from v 2 in the definition of F(v). Observe that F(v) is a perfect derivative 
and so has mean zero and thus no Fourier component at 0. 

Lemma 4.1. We have 

F(v) = F (v, v, v) + F^ (v, v, v) 
where the "resonant" trilinear operator F is given by 

(4.2) F (u^w)(k) := -6iku(k)v(k)w(-k) 

for fceZ', and the "non-resonant" trilinear operator F^q is defined by 

(4.3) F 7 t^v,w)(k) := 2i(fci + k 2 + k 3 )u(k 1 )v(k 2 )w(k 3 ) 

ki, k 2 , k 3 e Z* : 
fci + fc 2 + ^3 = k; 
(h + k 2 )(h + k 3 ){k 2 + k 3 ) t^O 

/or fc e z*. 



Proof. Consider the right-hand side of (4.1), and break the sum into pieces accord- 
ing to how many of the quantities k\ + k 3l k 2 + k 3 are zero. There is a single term 
in the sum for which k 2 + k 3 = k\ + k 3 = 0, and the summation in this case is 
F (v,v,v). If just k 2 + k 3 is zero, then the total contribution of this case vanishes 
since the summand in this case is antisymmetric with respect to swapping k 2 and 
k 3 . Similarly if just k\ + k 3 is zero. The remaining portion of the summation can 
be seen to be F^ {v, v, v) by a symmetrization in k\, k 2 , k 3 . □ 

If k = k\ + k 2 + k 3 , then we have the fundamental resonance identity 

(4.4) k 3 - (kf + k\ + k\) = 3(fci + fc 2 )(fci + k 3 )(k 2 + k 3 ) 

(see e.g. [1]). This justifies the terminology that F is "resonant" but F^ is 
"non-resonant" . 

We remark that, if u, v, w are real, then Fq(u,v,w) and F^q(u,v,w) are also 
real, despite the presence of the imaginary i in the definitions of these quantities. 
This follows from identities such as u(— k) = u(k). We leave the details to the 
reader. We also remark that eventually these two functions will be estimated in 
absolute value, so the constants which appear (e.g. the minus signs out front) will 
play no role. 
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4.1. The Fq (resonant) estimate. We now give an estimate for Fq. Morally at 
least, the bound we give follows from the trilinear version of (3.6), but we present 
an independent proof here for the sake of completeness. 

Lemma 4.2. For any s > 1/2, and any u,v,w G Y s with mean zero, we have 

(4.5) \\F (u,v,w)\\ Z s<\\u\\y4v\\y4M\y*. 

Proof. We shall just prove the endpoint case s = 1/2, as the general case easily 
follows (e.g. by using the identity dx~ X ^ 2 F {u, v, w) — F (dx~~ 1 ^ 2 u,v,w)). 

Split u = X^feez* u ki where u k is a complex- valued function whose spatial Fourier 
transform is supported on a single frequency k. Observe that 

F (u,v,w) = ^2F (u k ,v- k ,w k ). 
k 

Thus if we show that 

\\Fo(Uk,V-k,Wk)\\ Z i/z ^ IKH 1/2,1/211 V-k\\ 1/2,1/211 W k || 1/2,1/2, 

then the claim (4.5) follows by summing in k and using Cauchy-Schwartz in u and 
v (just estimating the w k term crudely by w). 

Fix k, and define the function G Uk (t) by u k (x,t) — e lkx e tk3t G Uk (t), so that 

\\u k \\ X s, s k> s || < t > 5 G Uk (T)\\ L 2 (m , 

similarly for G v _ k and G Wk . The claim then collapses (after some translation in 
frequency space) to the one-dimensional temporal estimate 

\\G Uk G v _ k G Wk \\ H -i/2 < ||G Ufc || JJ .i/2||G t ,_ fc || JJ i/2||G ! lt , fc H^.i/2 

and 

\\G Uk G v _ k G Wk (t) || l i < \\G Uk \\ H i/2\\G v _ k \\ H i/2\\G Wk \\ H i/2 . 

But both left-hand sides can be estimated by \\G Uk G v _ k G Wk \\ L 2 1 and the claim 
follows easily from Holder and Sobolev. □ 

4.2. The F^q (nonresonant) estimate. We now turn to the non-resonant por- 
tion F^o of the non-linearity. In analogy with (3.6), (4.5) we have the estimate 

(4.6) \\F^(u,v,w)\\ Z s<\\u\\ Y s\\v\\ Y s\\w\\ Y e 

for all s > 1/2 and u,v,w e Y s with mean zero. This estimate can be proven by 
the techniques used to prove (3.6) in [9], but we shall obtain it as a consequence of 
a slightly stronger version, which we now state. 

We first need some Littlewood-Paley notation. If TV is an integer power of two, 
we let Pn denote the dyadic projection operator 

P N u{k) = XN<\k\<2Nu{k). 

If iVo, Ni, N2, N3 are four integer powers of two, we let soprano, alto, tenor, baritone 
be a permutation of the indices 0, 1, 2, 3 such that 

Nsoprano ^ ^alto ^ ^tenor ^ ^baritone • 
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Theorem 4.3. Let No, N\, N2, N3 be integer powers of two. Then 
(4.7) 

iV 

\\Pn FM P NiU, P N2 V, Pn 3 w)\\ z i/2 < (— rN t enorlM|yi/2|M|yi/2|M|yi/2 

1* soprano 

for some absolute constant 16 a > 0. 

This means that (4.7) is only sharp when the output frequency No is essentially 
the highest frequencies, and the two lowest frequencies N ten or and Nbaritone are 
0(1). This means that very low Fourier modes can influence high modes, but 
medium and high modes do not. In addition, the high modes do not have much 
influence on the low modes 17 . From (4.7) one can easily obtain (4.6) by summing 18 
in the ATj. 

The estimates (4.5) and (4.7) give some intuition for why it's possible to find a 
finite dimensional approximation to the mKdV flow - and hence, using the Miura 
transform, for the KdV flow as well: the only nonlinear interactions for which we 
now have no sharpened estimates are the resonant interactions coming from Fq 
(which doesn't mix frequencies) and the high-low- low interactions in F^q. Heuris- 
tically, then, we might start believing that if we truncate high frequencies, the 
evolution will not see much of a difference at low frequencies. In fact, it is possible 
to use these estimates to prove low frequency approximation theorems for mKdV 
analogous to Theorems 1.2, 1.3, but we do not write out these results explicitly in 
this work. 

The rest of this section is devoted to the proof of Theorem 4.3. We remark that 
the computations in this section are not needed elsewhere in the paper, and the 
reader may wish to take (4.7) for granted on the first pass and move to the next 
section. 



Proof. We begin by reviewing some (non-trivial) estimates from [9]. 

The proof of (4.7) relies mainly on the trilinear estimate 

(4.8) ||«i«2«3lL; i( < II «i Wxo.i-rh IKH^i-^ W^-^i-^ 

proven in Section 7 of [9] . This estimate can be viewed as a trilinear variant of the 
L® t Strichartz estimate in [1], and its proof requires a small amount of elementary 
number theory. 



^The quantity <x shall vary from line to line. 

17 That is, when the soprano and alto dyadic factors were high frequencies and No were low, 
we have a small first factor on the right side of (4.7). 

18 More precisely, one first observes that the left-hand side of (4.7) vanishes unless N aoprano ~ 
N a lto- Then one decomposes u, v, w into dyadic pieces and exploits orthogonality of the projec- 
tions Pjv m the Y a and Z a spaces. We omit the details. 
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We will also use the following estimate, which follows relatively quickly from 
some bounds found in [9], 

(4.9) ii W^^ ii^^hi^ii.ii^ih,. 

for all s > 1/2 and some S > 0. To establish (4.9), recall Theorem 3 from [9] 

k k 

(4.10) iirNx-i-i ^niNi^. 

i=l i=l 

for s > i. We need also equation (9.2) in [9], which also holds when s > |, 

(4-11) || (T-fc3)l-* " L " L ' ~ l^lll^-l.lll^H^-!,!. 

Combining these two and writing for the moment 

W(k,r) = X¥ o(fc)((l - P )u^iT- P )(vw))(k,r) 

(4.12) =XkM k ) ikiu{k l )v{k 2 )w{k z ), 

k\ + k 2 + k 3 = k 
h,k 2 + k 3 ^0 

we conclude by (4.10) 

n (k)'W(k,T) H 

II— pyrr II < lluxll^-i.i • IIHIjf-Li 

(4.13) ^IH|y|H|y|H|y- 

We quickly conclude (4.9) from (4.13): looking at the definition of the norms in- 
volved, one sees that without loss of generality we may assume u(k),v(k),w(k) > . 
Next, by replacing the factor u(k) appearing in (4.13), (4.12) with Xfci>o(fci)«(fci), Xfci<ow(fci), 
one concludes (4.13) with the function W now replaced by, 

W H (k,T) = XkM k ) X] \ki\u(k 1 )v(k 2 )w(k 3 ). 

fci + k 2 + k 3 = k 
h,k 2 + k 3 ^0 

Repeating this argument while interchanging the roles of k\, k 2 , and then fci, k 3 and 
summing gives (4.13) with W replaced with, 

W IH (k,T) = X kM k ) S (\h\ + \k 2 \ + \k 3 \)u(h)v(k 2 )w(k 3 ). 

ki ■ k 2 ■ k 3 =/= 
(fci + k 2 )(k 2 + k 3 )(k 1 + k 3 ) £ 

By the definition of F /0 (4.3) this yields (4.9). 

We now begin the proof of (4.7). It will suffice to prove the estimate 
(4-14) 

H-P/Vo-P^oCP/Vl", PN2 V 1 ^3^)11^1/2,-1/2 < (— ) <T A r te ^ OJ .||u|| X l/2,l/2||u|| X l/2,l/2||w|| X l/2,l/2. 

1* soprano 
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Indeed, this estimate already controls the X 1 / 2 ^ 1 / 2 portion of the Zt- norm. To 
control the L\L\ portion, we observe from Holder that the left-hand side of (4.14) 
controls 

u,P N2 v,P N3 w)(k) 



(t - k 3 ) 



i+s 



and the claim follows by a suitable interpolation with (4.9) (decreasing a if neces- 
sary). 

It remains to prove (4.14). By duality this is equivalent to 



u d x 1 F^ (ui,u 2 ,u 3 ) dxdt\ < (-rp^ — ) CT ^ e ^or- 1 l w o 1 1 ^-3/2,1/2 1| ui 1 1 ^1/2,1/2 1| u 2 1 1 ^1/2, 1/2 1 1 u 3 1| j^i/2, 1/2 

soprano 



where m has Fourier support on the region \ki\ ~ N{. We have inserted the d x 1 
multiplier to cancel the (k\ + k 2 + k 3 ) factor in (4.3). 

The right-hand side is comparable to 

l 4 - 15 ) (t7 ) N tenor TTt. 1 1 II U 3 II X ' 1 / 2 ■ 

^soprano j— q 

Note that we may assume N soprano ~ N a i to since the left-hand side of (4.14) van- 
ishes otherwise. Hence the right side of (4.15) is bounded below (throwing away 

the factor ( A rJ^° a „ o )' T ) by 



iV tenor Iy baritone 1 ^ soprano \_\_ \\ U j II X - 1 / 2 

3=0 

Taking spacetime Fourier transforms and taking advantage of the frequency local- 
ization, we thus reduce to showing 

(4.16) 

3 

i J II "V'-v" : dr \ 

k ,h,k 2 ,k 3 ez* j'=° 

h + ki + k 2 + k 3 = 0; 
(fci + k 2 )(k 2 + k 3 )(k 2 + k 3 ) ^ 

3 

< N tlnor N blritone N 7opra.no Yl IK' II* ' 1 / 2 

3=0 

where dr is integration over the three-dimensional space {(to, t\, t 2 , t 3 ) e R 4 : 
To + t\ + t 2 + t 3 = 0} with measure dr := S(t + t\ + t 2 + t 3 ) Ylj =0 ^ T i- We 
remark that the above estimate is now symmetric with respect to permutations of 
k , k\,k 2 ,k 3 . 

Without loss of generality we may assume that the Uj are all non-negative. The 
next step is to exploit the implicit (tj — k 3 ) 1 / 2 denominators. From the fundamental 
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identity (4.4), 

3 

(4.17) Yl ^- k: 3 = -Y, k ^ = ^ ki + k ^ k2 + k ^ ki + k ^ 

J=0,l,2,3 j=0 

we see that 

sup (tj - fc?) > |fci + k 2 \\k 2 + fc 3 ||fci + fc 3 | 

.7=0,1,2,3 

I ^soprano kbaritone\\kalto ~t" kbaritone\\ktenor ~t~ ^baritone]- 

By symmetry we may assume that the supremum on the left-hand side is at- 
tained when j = 0. 

Lemma 4.4. FFe ftove 

(4.18) |& SO p rano -f- ^baritone I l^a/to ~f" kbaritone 1 1 kfenor ~t~ kbaritone I ^baritone ^ -^soprano 

Proof. We have four cases: 



Case 1: Nbaritone <C Ntenor -C iV a ; to . Then the left-hand side of (4.18) is 

soprano 



comparable to N* ano N t 

enor N baritone 



Case 2: Nbaritone ~ N tenor <C N a i to . Then the left-hand side of (4.18) is at 
least ^ N SO p rano Nt enor . 

Case 3: Nbaritone -C Ntenor ~ N a i to - Then the left-hand side of (4.18) is 
comparable with N^ oprano N ba ritone- 

Case 4: N ba ritone ~ N ten or ~ ^a/to- Then at least one of fci + fc 2 , k 2 + k 3 , 
k\ + fc 3 must have magnitude ~ N sopran o (since they sum to — 2fco). Since the other 
two factors have magnitude at least 1, the left-hand side of (4.18) is > N^ oprano . □ 

From this lemma, we have 

(TO — &o) ~ ^ soprano^ baritone' 

Thus to prove (4.16) it will suffice to show that 

3 3 

i e / ^ + Vo-fc 3 } i/2 n tj^Ti) dT \ $ n in 

ko,k±,k 2 ,kz e Z* : ' ^= J=° 

fc + fci + fc 2 + fc 3 = 0; 
(/ci+/c 2 )(fc 2 + fc 3 )(/c 2 + /c 3 )^0 

At least one of k\, k 2 , k^ is 0(N tenor ); by symmetry let's suppose it's fc 3 . Then we 

l/2(5X/2(5 

can bound N t l n ~ r by fc 3 , and then by undoing the Fourier transform and doing 
some substitutions the estimate becomes 



'//' 



V ViV 2 V z dxdt\ < ||vo||jto,o||v 1 || Jsf n,i/2||'?J2||^cj,i/2||TJ3||^ : i/2-^,i/2. 

But this follows directly from (4.8) if a is small enough. This proves (4.7). □ 
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5. Proof of Theorem 1.3: KdV low frequencies are stable under high 
frequency perturbations of data. 

We now prove Theorem 1.3. Fix s, T, u , u . 

We have no upper bound on the time T, and so in particular we cannot hope to 
control the flow Sxdvit) on the entire interval [— T, T] by a single application of the 
local well-posedness theory. On the other hand, because of the uniform bounds (1.3) 
we sec that we can divide [— T,T] into a bounded number C(s,T, ||wo||ff^) ll^o||/fg) 
of time intervals such that the local well-posedness theory can be used on each 
interval. It will thus suffice to prove a local-in-time version of Theorem 1.3; more 
precisely, it will suffice to show 

Proposition 5.1. Fix s > —1/2, and N' > 1. Let u n ,u n £ Hq be such that 
P<N'U = P<n'Uo- Then, if T' is sufficiently small depending on s, \\uo\\h*, and 
\\uo\\h>, we have 

sup \\P<N'-{N>)t/*{SKdv{t)uo - SKdv(t)uo)\\ H ° < (N')~ a C(s, \\u \\ H s , ||«o||/rg) 

\t\<T< 

for some a = a(s) > 0. 

The exponent 1/2 in (TV') 1 / 2 is not particularly important here; any exponent 
between and 1 would suffice. 

To see how this proposition implies the theorem, first recall that we may assume 
that N is large, N > C(s,T,\\u \\h=,\\uo\\h=), since the claim in Theorem 1.3 
trivially follows from (1.3) otherwise. (This same remark also applies of course in 
Proposition 5.1, allowing us to assume N' > C(s, \\uo\\h s , \\uo\\h s ) there too.) From 
(1.3) we may divide [—T,T] into C(s,T, \\uo\\h°, \\uo\\h*) time intervals, such that 
on each interval (a time-translated version of) Proposition 5.1 holds. Consider for 
example the first such time interval [0, T'\ on the positive real axis. We start with 
N' := 2N and apply Proposition 5.1, to get, 

sup \\P< N ,-r N >)i/2(S K dv{t)uo - S K dv(i)uo)\\ H > < (N'y^Cis, \\u \\ H ^\\u \\ H ^). 
te[o,T'] 

Before moving on to the next subinterval, modify SKdv{T')uo on frequencies \k\ < 
N' — (N 1 ) 2 to agree with SKdv(T')uo. By the local-well posedness theory and the 
triangle inequality, we can proceed as on the first subinterval, decrementing N' by 
(TV') 1 / 2 each time we apply Proposition 5.1, to obtain Theorem 1.3 if N (and hence 
N') is sufficiently large. 

It remains to prove Proposition 5.1. Henceforth we allow our implicit constants 
to depend on s, \\uo\\h*, and ||uo||h=- 

Define, 

v := M _1 u ; v(t) := S mK dv(t)v ; 
vq := M _1 {t ; v(t) := S m Kdv(t)v ; 
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from Theorem 2.1 we thus have 

HHj^+i, ll^oll^+i < c 

while from (2.1) we have 

S K dv(t)u = Mv(t); S Kdv {t)u = Mv(t). 
Our task is thus to show that 

(5.1) sup \\P< N ,_ (Nl) i„(Mv(t) - Mv(t))\\ H s < C{N'y a - 

\t\<T> 

Henceforth we allow the quantity a > to vary from line to line. 

We first investigate the discrepancy between v and v at time 0. 
Lemma 5.2. With v ,v defined as above, we have, 

\\P<N>(vo-v )\\ H .+i<C(N')-< r . 

Proof. From the definitions and our assumptions on Uo,uq we have 

P< N >(Mv - Mu ) = 0. 
On the other hand, from Theorem 2.1 we have 

\\P<n>(v - v )\\ H s+i < C\\MP< N ,v - MP< n ,vo\\hs. 
Thus by the triangle inequality it will suffice to show the commutator estimate, 

(5.2) \\MP< N ,vo - P<n>Mv \\h° < C{N')-° 
and similarly for vq. 

Clearly it will suffice just to consider vq. From the definition (1.8) of the tran- 
form M and the fact that Pq, P<n' and d x all commute, we have 

MP< N ,v - P< N ,Mv =(1 - P )[(P<n'V ) 2 - P< N ,{v ) 2 } 

=(l-P< N ,)[(P< N ,v ) 2 } 

- (P< N , - P )[((l - P< N ,)v )((l + P< N ,)v )}. 

But the last two terms have an Hq norm of 0((N')~ a ) for some a > 0; this can 
be seen by the Sobolev multiplication law (2.2), the H s+1 bound on u , and the 
estimate 

- P<N')v\\ H s <N-°\\v\\ H s +< , 

to extract the (N')^ (J decay from the high frequency projection 1 — P<n>- The 
claim follows. □ 

We still have to prove (5.1). It will suffice to show that 

(5.3) sup \\P< N ,- {Nr/ 2(v(t) - v(t))\\ H s +1 < C{N')- a - 

\t\<T' 

This is basically because the commutator of M with P<N>-(N'y/ 2 i s small thanks 
to the argument in the proof of Lemma 5.2. We omit the details as they are very 
similar to those in Lemma 5.2. 



SYMPLECTIC NON-SQUEEZING FOR THE KDV FLOW 



29 



From Lemma 5.2 we see that vo and vo are almost identical at low frequencies 
| At I < N'. In fact, because the solution map S m Kdv{t) is locally Lipschitz 19 in 
Hq +1 , we may assume that 

(5.4) P< N ,(v -v ) = 0, 

since the general case then follows by modifying vq (or vq) by a small amount in 
Hq +1 and using the Lipschitz property. 

Henceforth we assume (5.4), so that the low frequency < N') portions of 
v(t) and v(t) arc identical at time 0. Our task is to prove (5.3), which asserts 
that the slightly lower frequency (|fc| < N' — (iV') 1 / 2 ) portions of v(t) and v(t) are 
still very close together at later times. This will be achieved primarily through the 
improved trilinear estimate (4.7). 

In what follows we assume all our spacetime norms are restricted to the time 
interval [-T',T']. 

From the local well-posedness theory of mKdV (See 20 (3.5), (4.5), (4.6), or [1], 
[20], [9]) we have the local estimates 

(5.5) + ||5||ys+l < C 

if the time T" is chosen sufficiently small depending on the H^ +1 norms of v , v . 

The frequency interval [N' - (iV') 1 / 2 , N'] contains O^TV') 1 / 4 ) intervals of the 
form [M, M + (TV') 1 / 4 ]. By orthogonality and the pigeonhole principle, we see that 
there must exist one of these intervals [M, M + (TV') 1 / 4 ] such that 

(5.6) ||(P<m + (jv')i/4 -P<m)v\\ Y s+i + ||(P<m +( ;v')V4 -P<m)v\\y^ < C(N')- a - 
Fix this M. We split 

v = Vlo + v med + v hl 

where 

vio ■= P<mv; v med = {P<M+{N') 1 / i - P<m)v; v hi := (1 - -P<m+(w) 1 / 4 K 
Thus from (5.5), (5.6) we have 

(5.7) ||uio||y«+ 1 ! < C; ||^med||y+i < C(N')~ a . 

Applying P<m to (1.9) and using Lemma 4.1, we see that vi a obeys the equation 
(p t + d xxx )v io = P< M F (v, v, v) + P< M F^ (v, v, v). 
From the definition (4.2) of the resonant operator Fo, we see that 

P<mF {v,v,v) = F (vi o ,vi o ,v lo ). 

The situation for F^ is more complicated as this nonlinearity will mix vi D , v mec i, Vhi 
together. Define an error term to be any quantity with a Z s+1 norm of 0((N')~ a ). 

Since we are assuming T" to be small this follows directly from the local well-posedness 
theory. 

2t ^Strictly speaking, when the data t>o, vo has large Hq +1 norm one has to first rescale the 
torus by a suitable scaling parameter A in order to close the iteration, but this has no significant 
effect on our argument. The details are carried out in [9], [8]. 
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From (5.7) and (4.6) we see that any term in F^o(v, v, v) involving v me d is an error 
term. 

Now let us consider the terms which involve Vhi- A typical term is 

P<MF^o{vio,Vlo,V h i)- 

We can dyadically decompose this as 

Pn P<M F^oiPN^lo, Pn 2 v Io, PN 3 v hi)- 

N a .N 1 ,N 2 ,N 3 

Such a term can be estimated using the frequency separation between vi and Vhi- 
for the summand to be nonzero, wc need N U N 2 < M, and N 3 > M+(N')i. Using 
the notation in the definition of F^a (4.3), we also need |fci + + &3| ~ No < M, 
hence we must clearly also have N tenor > (iV') 1 / 4 . From our non-resonant estimate 

(4.7) , the bounds (5.7) above, and a summation of the dyadic indices Nj (conceding 
some powers of log N' if necessary) we thus see that this term is an error term. A 
similar argument shows that any other term involving vm will also be an error term. 
Thus we see that vi D obeys the equation 

(5.8) (d t + d xxx )vi = F (vi o ,vio,vi o ) + P<mF^ (vio,vio,vi o ) + error terms. 

By similar reasoning, the function vi '■= P<mv also obeys the same equation (but 
with slightly different error terms, of course). Since 5/ o (0) = vi o (0), we thus see 
from the standard local well-posedness theory 21 that 

(5.9) \\vio - vi \\ Y s + i <C{N')-° 

which by (3.4) implies (5.3) as desired. This proves Theorem 1.3. 

6. Proof of Theorem 1.2: BKdV approximates KdV at low 

FREQUENCIES. 

We now prove the more difficult of our KdV approximation theorems, namely 
Theorem 1.2. The proof here is definitely in the same spirit as that of Theorem 
1.3, in that we show two flows remain close by showing that their mKdV analogues 
remain close. However, the proof will be more complicated since one of the flows 
being studied is Sbkclv ( scc (1-7)), and the standard Miura transform M defined 
by (1.8) seems an inappropriate tool with which to pull the SbkcIV flow back to 
an mKdV-type evolution, as it introduces a v^, type nonlinearity on the right side 
of (1.9) which is too rough for us to estimate. Instead, we introduce a modified 



A rough sketch of what we have in mind here is: write G for that portion of the nonlinearity 
on the right side of (5.8) not involving the error terms, and note 

/"* 3 

Vlo-'"lo= / e l <*~ T )«" (G(v lo ) - G(v lo ) + error terms) dr. 
Jo 

Writing G(v lo ) - G*(i> !o ) = DG{9vi + (1 - 9)v lo )(vi - v lo ))dO, wc use (3.5), (4.5), (4.6), and 
the fact that by scaling, we may assume that the data for f; ,S; are small in Y s+1 to conclude 
(5.9). 
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Miura transform Ms. This strategy is illustrated in (6.1), where we have written 
SBmKdv for the flow which intertwines Ms and BKdV in the sense that 

M B o S BmK dv{t) o Mg 1 = S BK dv(t). 



M 



(6.1) 



uo 
u 



S m Kdv(t) 



SKdv(t) 



SBKdv(t) 



M, 



^0 



r(t) 



V(t) 

u(t) 

«(*) 

v(t) 



We can summarize the proof of Theorem 1.2 (using the same notation as in (6.1), 
which will be defined momentarily!) by saying that u(t), u{t) are shown to be close 
at low frequencies by showing that v(t),v(t) are likewise close. 

We now turn to the details. Fix s > -1/2, T > 0, N » 1, B, and u Q e i/g; our 
implicit constants may depend on s, T, and ||tto||j?g- We work exclusively in the 
time interval [— T, T]. 

Let tt(t) := SBKdv{t) u o denote the evolution of the flow (1.7). Our task is to 
show that 



(6.2) 



sup \\P< N i/2(S K dv(t)u - u(t))\\ H ° < N 

\t\<T 



We first claim (in analogy with (1.3)) the bound 
(6-3) sup \\u(t)\\ H s < 1, 

\t\<T 

if N is large enough. This bound is achieved by a repetition of the arguments in [8]. 
As it is somewhat technical and uses techniques different from those elsewhere in 
this paper (notably the "/-method"), we defer the proof of (6.3) to an Appendix. 

We may assume from (6.3) and the local well-posedness theory 22 that w 0j an d 
hence u, is smooth. 

The Miura transform (1.8) intertwines the KdV flow with the (renormalizcd) 
mKdV flow (1.9), (1.10). We seek a similar transform to intertwine the KdV-likc 
flow SsKdv with an mKdV-like flow. It turns out that the correct transform to use 
is given by 

(6.4) M B v := v x + B(l - P a )(v 2 ) = v x + B(v 2 ) - P a (v 2 ), 

where of course the multiplier B here is that which appears in the flow (1.7) above. 

22 The well-posedness theory for KdV from [18] can be applied without substantial change to 
the BKdV equation (1.7). The presence of the multiplier B on the right hand side presents no 
difficulty. 
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As with M, the operator Ms is a locally Lipschitz map from Hq +1 to Hq. We 
now address the question of invertibility of Ms. 

Let v be a function bounded in Hq +1 . We first look at the derivative operator 
M.' B (v), defined by 

M' B (v)f := f x + 2B(l-P ){vf). 

Lemma 6.1. Fix v G Hq +1 ,s > — ^, and allow the implicit constants in this 
Lemma to depend on \\v\\ H a+i. If N is sufficiently large, then M s (f5) is invertible 
from Hq to H^ +1 , in the sense that 

\\M' B (v)-'f\\ Hi+l < \\f\\ H . 

for all (smooth) f. 

Proof. Recall from the proof of Theorem 2.1 that we have the bound 
(6-5) \\M'(v)-'f\\ H s +1 < \\f\\ H s. 

We proved this for s = —1/2 but it is easy to see the same argument works for 
s > —1/2. From the resolvent identity 

O- 1 = A-\l- {A-0)A- 1 )- 1 

it thus suffices to show that the operator 

(M' B (S)-M'(w))M'(w)- 1 

is a contraction on Hq. Applying (6.5) again, it thus suffices to show the bound 

\\M B (v)f -M'(v)f\\ H , «\\f\\ H , +1 . 

But the left-hand side is just 

||2(1 - B){vf)\\ H . < N-°\\vf\\ HS+ „ < N-°\\v\\ HSo+ 4f\\ Hr < N-°\\f\\ H . +1 

by (2.2) for some a > 0, and the claim follows if N is sufficiently large. □ 

Corollary 6.2. Let R > 0, s > — |. If N is large enough depending on R, then 
there is a map M^ 1 defined on the ball B°°(0;i?) := {u G Hq : \\u\\h§ < R} which 
inverts M_b and is a Lipschitz map from B°°(0; R) to Hq +1 . 

Remark: Recall that M B depends on N through the definition of B (see (1.7)). 

Proof. Fix R; implicit constants are allowed to depend on R. 

Let u e B°°(0; R). To define M^ 1 at u we of course have to solve the equation 

Msw = u. 

From Theorem 2.1 we can find a w ap pr, bounded in Hq +1 , such that 

Mw appr = u. 

We now apply the ansatz v — v appr + w. One easily checks, using (6.4), that w 
verifies the difference equation 

w x + B(l - P )(2v apw w + w 2 ) = (1 - B)(v 2 apw ) 
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or cquivalcntly 

w = M'sfoppJ-^l - B)(v 2 appi ) - M'siv^^Bil - P )(w 2 ). 

Since w appr is bounded in H s+1 we see from Lemma 6.1 and (2.2) that 

HM'afoppO-^l - B)(vl ppr )\\ HS+1 < N~". 

A contraction mapping argument again using Lemma 6.1 and (2.2) thus shows that 
a solution w to the above difference equation exists and obeys the bound 

\\w\\ H s +1 <N~° 

if N is sufficiently large. In particular we see that M^ 1 exists at u and that M^ 1 
is bounded on H§. 

The Lipschitz bound now follows from Lemma 6.1 and the inverse function 
theorem, since Ms is a smooth map from Hq +1 to Hq. (Equivalcntly, one can 
use contraction mapping arguments similar to the one above to show that M^ 1 is 
uniformly Lipschitz on very small neighbourhoods of u, and hence on the whole 
ball B°°(0;fl)). □ 

Thus if N is large enough, the above corollary and (6.3) let us write 

(6.6) v(t) = M^ift) 
and conclude also that, 

(6.7) sup < 1. 

\t\<T " 

From the Leibnitz rule we see that 
u t = M' B (v)v t 

u x = M' B (v)v x = v xx + 2B(vv x ) 

u xxx — M' B (v)v xxx + 6B(v x v xx ) 

uu x = {Z x + B(£ 2 ) - P (v 2 ))M' B (v)v x 

= v x v xx + 2v x B{vv x ) + B(v 2 )v xx + 2B(v 2 )B(vv x ) - M' B (v)(P {v 2 )v x ) 

where we have used the fact that Po(ffx) = for any /. Expanding out (1.7) and 
canceling the two terms of 6B(v x v xx ) which appear, we obtain 

M' B {v){v t +v xxx ) = 6B(25 ;c B(w ;c )+B(5 2 )« ;e;e +2 j B(« 2 )B(55 ;c ))-BM' b (5)(6Po(w 2 )«x). 

The first term of the right-hand side is roughly M.' B (v)(6B(B(v 2 )v x )) . Indeed, a 
computation shows 

M' B (v)(6B(B(v 2 )v x )) = 6B(2v x B(vv x ) + B(v 2 )i xx ) + 125(1 - P )(vB(B(v 2 )v x )). 
Thus we have 

M' B (v)(v t + v xxx - 6B(B(v 2 )v x ) + 6B(P Q (i 2 )i x )) = 12E 1 + 6E 2 
where the error terms E\, E 2 are the "commutator expressions" 
Eh. := B(B(v 2 )B(vv x ) - (1 - P Q )(iB(B(v 2 )i x ))) 
E 2 :=P (v 2 )[M' B (v),B]v x 
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Thus v obeys the equation 

(6.8) v t + v xxx = 6B((B-P ){v 2 )v x )+M' B (v)- 1 (12E 1 + 6E 2 )- 5(0) = v - 

We have written SBmKdv(t) m figure (6.1) to represent this flow. Since -5 is smooth, 
it is a priori in the space Y s+1 when restricted to the interval [—T,T]. We now 
seek to control the non-linear terms in (6.8). 

If it were not for the error terms E\ , E 2 , one could obtain bounds of the form 

(6.9) \\v\\y*+i ^ 1 

from (6.7) and the local well-poscdness theory for mKdV in [9] (which can easily 
handle the presence of the order operator B). To deal with the E\, E 2 terms we 
use the following estimate. 

Lemma 6.3. We have 

(6.10) \\M! B {v){t)- 1 E j \\ z . +1 <CN-". 
for j = 1,2, andte [-T, T\. 

Proof. By (3.3) and Lemma 6.1 (using (6.7), of course) it suffices to show that 
(6-11) \\E j \\L?H.<N-''. 

We first prove this for E\. Observe that B(v 2 )B(vv x ) = d x \{B{v 2 )) 2 has mean 
zero, and so we can factor out a (1 — P ), and reduce to showing that 

\\wB(vv x ) - vB(wv x )\\ Hi < N-° 

where we have used the shorthand to := B(v 2 ). 

By (2.2) we see that w is bounded in H^ +(y for some a > 0. From the identity 

wB(vv x ) - vB(wv x ) = w[B, v]v x - v[B, w]v x 

and another application of (2.2), we see that it suffices to show the commutator 
estimate 

(6-12) ll[B,/].9ll^<^- ff/2 ||/||^ll.9ll^. 

Without loss of generality we may assume that / and g have non-negative Fourier 
transform. Observe that 

[BJ]g(k)= ]T (b(k)-b(k 2 ))f(h)g(k2). 

ki+k2—k 

The quantity b{k) — b(k 2 ) is clearly 0(1). If |fci| <C ./V then one also obtains a bound 
of 0(\ki\/N) by the mean- value theorem. Thus we have a universal bound of 

Mk)-b(k 2 )\ < \h\^ 2 N- a / 2 . 

The commutator estimate then reduces to 

\\(\d x r /2 f)g\\H,<\\f\\ Hl+ 49\\H„ 

but this follows from (2.2). 
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Now we prove (6.11) for E2. From (6.7) we see that Po(v 2 ) is bounded in time, 
so it suffices to show that 

\\\M' B (v),B]v x \\ L ~ H .<N-<'. 

Since [d x ,B] = 0, we have 

[M' B {v),B]v x = B(l - P )(vBv x ) - B 2 (l - P )(vv x ) = B(l - P )[v,B]v x , 
and the claim follows from (6.12). □ 

From this lemma and perturbation theory in the Y s+1 spaces (using the local 
well-posedness theory in [9]) we thus obtain (6.9). 

We now repeat the argument from Section 5. Recall the notation from figure 
(6.1) that v(t) = S m Kdv(t)vo. From (1.3), (2.1) and Theorem 2.1 we see that v(t) 
is uniformly bounded in H s+1 . From the local well-posedness theory for mKdV we 
thus have 

N|y. + i<l. 

From this and (6.9), we may find an interval [M, M + TV 1 / 4 ] C [TV 1 / 2 , 27V 1 / 2 ] such 
that 

\\(P<m+nv* - P<m)v\\v+i + \\{P<M+m/* - P<m)v\\ Y '+i £ 
Fix this M. Set 

via := P<mv and vi a (t) := P<mv. 
By arguing as in the previous section we see that vi obeys the equation 

(6.13) (d t +d xxx )vi = F Q (vio,vio,vio) + F^ (vi o , v io , vi a ) + error terms 

where the error terms have a Z s+1 norm of 0(N~ a ). We now claim that vi obeys 
the same equation (but with a different set of error terms, of course). Assuming 
this claim for the moment, note that vi and vi have the same initial data, so we 
obtain, 

(6.14) sup \\vio{t)-vio(t)\\ H . + i <N-° 

\t\<T 

by perturbation theory. The bound (6.14) implies our goal (6.2) relatively quickly: 
apply the Miura transform M (sec (1.8)) to the difference on the left side of (6.14), 
and use the commutator bound (5.2), the fact that P<AfM = P<mM-b, and M > 
to conclude that, 

N-° > \\P< M M B i(t) - P< M Mv(t)\\ H s 
( 6 - 15 ) Z\\P< Nh u(t)-P< Nh u\\ HS , 
as desired (see (6.2)). 

It remains to show that vi a verifies (6.13). Applying P<m to (6.8) and using 
Lemma 6.3 we have 

(d t + d xxx )vio = 6P<m((B - P )(v 2 )v x ) + error. 

By repeating the argument in Section 5 we have 

6P<m{{1-Po)(v 2 )v x ) = P< M (Fo{v,v,v)+F^o(v,v,v)) = F {vi ,vi 07 vi )+F^o{vio,vi 0} vi )+ciroT terms. 
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Thus it will suffice to show that 

(6.16) P<m((1 — B)(v )v x ) = error terms. 

For a fixed time t, the spatial Fourier coefficient of the left-hand side at (k, t) is 
^2 X[-M,M](k)(l - b{k x + k 2 ))v(k 1 ,t)v(k 2 ,t)ik 3 v(k 3 ,t). 

k=ki +k2+ka 

The summand vanishes unless \k\ < M < N 1 / 2 and \k\ + k 2 \ > N, which forces 

\h\>N. 

First consider the contributions of the case when (fci + k 2 )(k 2 + k 3 )(k\ + k 3 ) ^ 0. 
We now apply (4.7). By our previous discussion we have No < N 1 / 2 and N soprano > 
N, hence we see from (6.9) (writing things in terms of spacetimc Fourier transforms 
instead of spatial Fourier transforms, taking absolute values and discarding the 
1 — b{k\ + k 2 ) factor) that this contribution is error. 

It remains to consider the case when (k\ + k 2 )(k 2 + k 3 ){k\ + k 3 ) = 0. By the 
previous discussion k\ + k 2 cannot be zero, while \k 3 \ is much larger than Thus 
the only two cases are when {k\,k 2 ,k 3 ) is equal to {k,—k 3 ,k 3 ) or (— k 3 , k, £3), so 
by symmetry the total contribution to the Fourier coefficient is 

2x[-m,m] (&) ^2 ik 3 (l - b(k 3 - k))v(k,t)v(-k 3 ,t)v(k 3 ,t). 

\k 3 \>N 

Combining the k 3 term with the —k 3 term, this becomes 

2X[-M,M](fc) i h{b(- k 3-k)-b(k 3 -k))v(k,t)v(-k 3 ,t)v(k 3 ,t). 

k3>N 

By the mean-value theorem and the fact that b is even, we have 

(b(-k 3 -k)- b(k 3 - k)) = 0(\k\/N) = 0(N- a ). 
Meanwhile, we have 

]T \k 3 \\i(-k 3 MHh,t)\<Ml r <i. 

k 3 >N 

Thus the above Fourier coefficient is 0(N^ cr \v(k,t)\). By (6.7) we thus see that 
this contribution to (6.16) has an L^°Hq +1 norm of 0(N~ a ). By (3.3) we thus see 
that this contribution is error as desired. This completes the proof of (6.13) and 
hence (6.2). This concludes the proof of Theorem 1.2. 

7. Proof of Theorem 1.5: Symplectic nonsqueezing of KdV 

Let N 1, and let b be a symbol adapted to [— N, N] which equals one on 
[— N/2, N/2], and let B be the associated Fourier multiplier. We begin by consid- 
ering the modified Hamiltonian Hn on P<nH 1 (T), defined by 

H N (u) := J -lul-{Bu) 3 dx. 
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We compute the Hamiltonian flow on P<nH q corresponding to ipy. Fix a,u£ 
HZ 1 ^ 2 - We see that 



—Hn(u + ev)\ e= a = [ —u x v x — 3(Bu) 2 Bv dx 
de Jj 

={-u xxx + 6B((Bu)(Bu x )), v}. 

— 1/2 

Since — u xxx + 6B((Bu)(Bu x )) is in P<atP , we conclude as in (1.15), (1.16) 

— 1/2 

that the Hamiltonian flow of Hn on P<atP ' is given by 

(7.1) u t + u xxx = 6B((Bu)(Bu x )); u{0) = u a e P< N H^ /2 (T). 

Let S^J v (t) denote the flow map associated to this equation; for each t, we observe 
that S^J v (t) is thus a symplectomorphism on the finite-dimensional symplectic 
vector space P<nH 1 . In particular, it obeys Theorem 1.7 (that is, we pick 
S g1a\ = S< Kdv))- To conclude the proof of Theo rem 1.5 it thus suffices to show 
that the flow S^J v (t) obeys the weak approximation property in Condition 1.8: 

Proposition 7.1. Let k e Z* , T > 0, A > 7 < e <C 1. Then there exists an 
N = N (k ,T,s,A) > |fc | such that 

\ko\- 1/2 \SK^r)u (ko) - s£J v (T)u (k )\ < e 
for all N > N and all u <G B W (0, A) (see (1.25) for the definition of this ball). 

Proof. We make the transformation w := Bu, where u solves (7.1). Applying B to 

(7.1) we obtain 

w t + w X xx = 6B 2 (ww x ); w(Q) = Bu 
which is (1.7) with B replaced by B 2 . Thus we have the intertwining relationship 
described by (1-27) in the introduction to this paper, 

BS Kdv( t ) u o = S B ^Kdv{t)Bu a . 
In particular, if No S> |feo|, then b(ko) = 1, so we have 

(7.2) S^J v (T)u (k ) = (S B2Kd ^(T)Bu )(k ). 

From Theorem 1.3 we have 

(7.3) \k a \- 1/2 \S K ' d 7(T)u a (ko) - S Kd ^T)Bu (k )\ < N~° . 
From Theorem 1.2 we have (if N is large enough, No 3> fco) 

(7.4) \k \- 1/2 \S K< MT)Bu Q (k Q ) - S B 2 Kd ^iT)Bu n {k )\ < N~", 

where the implicit constants are allowed to depend on T and A. By (7.2), the 

second term on the left of (7.4) is the same as 5^ v (T)u (fco)- Combining this 
observation with (7.3), (7.4), and the triangle inequality, we obtain the desired 
claim, if No is sufficiently large depending on fc , T, e, A. □ 



The proof of Theorem 1.5 is now complete. 
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8. Proof of Theorem 1.1: P<nK<1V does not approximate KdV 

Informally, the point of this section is that there is absolutely no slack in the 
bilinear estimate (1.4) at regularity s — —1/2 no matter what the frequencies of 
the various functions are; see the examples in [18]. But to convert the examples 
for the bilinear estimate to quantitative estimates of the KdV and truncated KdV 
flow - in particular, to establish that the two flows differ as claimed in Theorem 1.1 
- we must do some tedious computation of iterates, which we detail below. 

Fix ko,A, T, for instance T, A ~ 1; our implicit constants in this section will be 
allowed to depend on these parameters. Without loss of generality we may assume 
that ko > 0. We let < a <C 1 be a small parameter depending on fc , A, T to be 
chosen later. 

Let N 3> er~ 100 be a large integer. We consider the initial data 
u {x) := a 3 cos(fc x) + <tN 1/2 cos(Nx). 

1/2 

Note that u lies in P<mH q (T) with norm 0(a), and in particular we have 
u € B w (0; A) if a <C 1 is sufficiently small. 

Let u and be the solutions to the KdV flow (1.1) and truncated KdV flow 
(1.5) respectively, with initial data u(0) — v,( N \0) = u . We shall show that, if a 
is sufficiently small, 

(8.1) |<T)(fc )- U W(T)(fco)|^a 5 , 
which gives (1.6). 

To prove (8.1) we need good approximations of u and u^ N '. To approximate u, 
we look at the iterates for j = 0, 1, 2, . . . defined inductively by u^~^(t, x) = 
and 

(8.2) (dt + d xxx )uW = d x (3(u^) 2 ); u^(0) = u . 

From the contraction mapping arguments in [18] (see also [9]) we know that the 
converge to u in the Yj norm; indeed each iterate is closer to u by a factor 
of at least 0(a) compared to the previous one 23 . A routine calculation yields 

u [0] (t, x) = a 3 cos(fc x + kit) + <tN 1/2 cos(Nx + N 3 t), 

and thus 

d x (3(uM) 2 ) = -'^C7 4 N 3 / 2 sin((N+k )x+(N 3 +k 3 )t)-^N^ 2 sm((N-k a )x+(N 3 -k 3 )t)+O z (a 6 ) 



Strictly speaking, this contraction mapping property was only proven for T sufficiently small, 
but by subdividing [0, T] into a finite number of small intervals one can obtain the same contraction 
mapping for arbitrary T if a is sufficiently small depending on T. This naive argument requires 
cr < e-° T for some C; the more sophisticated scaling argument in [9] can improve this to 
a -C T -1 / 3 ~, but we will not need this quantitative improvement for our arguments here. 
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where Oz(K) denotes a quantity with a Z^ Q ^ norm of 0(K) (note that we have 
used the hypothesis N » er~ 100 to absorb several terms into this 0z(c 6 ) error 24 ). 



Observe that 



(d t +d xxx ) (-^N^cosdN + k )x + (N 3 + fcg)i) - cos((7V + k )x + (N + k ) 3 t)}) 
= -^a 4 N 3 ' 2 k sm((N + k )x + (N 3 + k 3 )t) + O z (a 6 ) 

and 

(d t +d xxx ) Qa 4 iV- 1 / 2 (cos((iV - k )x + (N 3 - k 3 )t) - cos((N - k )x + (N - k Q ) 3 t)) 

= -^o A N 3 ' 2 k a sin((7V - k a )x + (N 3 - k 3 )t) + O z (<J 6 ). 
Combining this with the calculation of d x (3u^) 2 above and using (3.5) we obtain 

vP(t,x) = U M(t,x) 

+ (^-ia 4 iV- 1 / 2 fc - 1 (cos((iV + k )x + (N 3 + k 3 )t) - cos((N + k )x + (N + k ft)) 
icr 4 Ar- 1 / 2 fc- 1 (cos((iV - k )x + (N 3 - k 3 )t) - cos((iV - k )x + {N - k ft)) 

where CV(er 6 ) denotes a quantity with a Yj ^j 2 norm of 0(er 6 ). In fact, since the 
cos((iV ± ko)x + (N ± ko) 3 t) terms are already Oy (<r 6 ) we have 

U M (t, x) = u [°] (t, a;)+icr 4 iV^ 1 / 2 fc ( 7 1 (cos((7V - k Q )x + (N 3 - k 3 )t) ~ cos((N + k )x + (N 3 + k 3 )t)) +Oy (ct 6 ). 

Using (1.4) to handle any interaction with a factor of c 6 or better, we obtain 
(8.3) d x (3(uW) 2 ) = d x (3(u^f) + O z (* 6 ). 

Note that there are two additional, potentially disruptive terms of the form ±|c 5 sin(/co.T+ 
fcgt) which appear in the expansion of d x (3(u^) 2 ), but they have opposite signs 
and so cancel 25 each other. From (8.3) and (3.5) we have 

= u^ + Y (a 6 ). 

From the contraction mapping property of the iteration map we thus have 

w = U W +Oy(a 6 ). 



24 For example, the term <r 4 N 2 fc sin((7V +ko)x+(N 3 +k^)t) which appears when one calculates 
d x (3(u^) 2 ) is Oz(cT 6 ), as the space-time Fourier transform of this term is supported a distance 
approximately TV 2 from the cubic t = £ 3 . Hence when computing the Z~ 2 norm of this term, we 
get a factor of TV -1 -C a 100 from the denominator in the definition of this norm. 

25 This special cancellation seems to be what distinguishes the KdV flow (1.1) from superficially 
similar flows such as (1.5), and is crucial to obtaining our high-frequency and low-frequency 
approximation results for this flow. It is instructive to see this cancellation via the rcnormalized 
mKdV flow (1.9) by computing iterates for mKdV and then applying the Miura transform to 
those iterates. 
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In particular we see that 

(8.4) <T)(fco) - v^iTKko) + 0(a e ) = vM(T)(k ) + 0(a 6 ). 

Now we approximate u^ N \ To do this we construct iterates vP\ j = 0, 1, 2, . . . 
for the truncated equation by setting := and 

By a variant of the local well-posedness theory from [18] (and [9]) we know that 
will converge to in the Y norm. By reviewing the computation of vP^(t,x), 
but now bearing in mind the presence of the projection Pjy, we obtain for the first 
iterate, 

u [1] (t, x) = u [a] (t, x) + ^a 4 N~^ cos((A - k )x + (N 3 - k 3 )t)) + Y (<t 6 ) 

= u^(x,t) + Y (a 6 ). 

Comparing this with the formula for above, we note that the Fourier modes at 
±(N + ho) are not present here. As a consequence, the analog of (8.3) reads, 

X (3(«W) 2 ) = d x (3(uW) 2 ) + sin(k x + k 3 t) + O z {a 6 ). 
Since (d t + d x )(tsm(k n x + k^t)) = sin(k x + fcgt), we can write, 

u [2] = u [1] + ^a 5 tsin(k x + fcgt) + Oy(<r 6 ). 
We can easily check then that, 

d x (3(uM) 2 ) = d x (3(uW) 2 )+O z {a% 

hence — vP 1 + Oy(er 6 ), which by the contraction mapping property implies 
that 

U (N) =il W+0 Y (a 6 ). 

In particular we see that 

uW(T)(fco) - fiM(T)(fco) + 0(a 6 ) = wW(T)(fc ) - |*T<7 5 e ifc o T + 0{a 6 ). 
Comparing this with (8.4) we obtain (8.1) as desired. This proves Theorem 1.1. 

9. Appendix. Proof of (6.3): H s bound for the BKdV flow 

We now prove the bound (6.3) for Hq solutions to the KdV-like equation 

Ut + u xxx = 6B(uu x ); u(0) = u 

with ||uo||h« < 1; this bound is needed to complete the proof of Theorem 1.2 and 
hence Theorem 1.5. 

If s > then this bound follows from L 2 conservation and standard persistence 
of regularity theory (see e.g. [1]), so we shall assume that — 1/2 < s < 0. 
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To do so, let us first review (from [8]) how the corresponding bound (1.3) was 
proven for the KdV flow 

iH + u xxx = 6uu x ; u(0) = u . 

9.1. Review of proof of H s bound for KdV (1.3). The idea is to modify the 
conserved L 2 norm j u 2 to something resembling the H s norm and which is still 
approximately conserved. To do this it is convenient to introduce some notation 
for multilinear forms. 

If n > 2 is an integer, we define a (spatial) n-multiplier to be any function 
M n (ki, . . . , k n ) on the (discrete) hyperplane 

r„ := {(fci, . . . , k n ) e K ■ ki + • . . + k n = 0}. 

If M n is a n-multiplier and u\, . . . ,u n are functions on R/27rZ, we define the 
n-linear functional A„(M„; u\, . . . , u n ) by 

n 

A n (M n ; /i,...,/„) := ^ M n (k u . . . , k n ) ]J f^kj). 

(fci,...,fe„)er„ i=i 

We adopt the notation 

A„(M„; u) := A„(M„; u,...,u). 

Observe that A„(M„;/) is invariant under permutations of the kj indices. In par- 
ticular we have 

A„(M„; u) = A„([M„] sym ; u) 

where 

(9.1) [M n ] sym (k) := 1 ]T M n (a(k)) 

' creS n 

is the symmetrization of M n . 

Thus, for instance, we have J u 2 — 27tA2(1;m), and more generally = 
2nk 2 {\k 1 \ s \k 2 \ s -u) = 2iik 2 {\k 1 \ 2s -u) ioYueH^. 

Now suppose that u obeys the KdV evolution (1.1), and M n is a symmetric 
multiplier. Then we have the differentiation law 
(9.2) 

— A n (M„; u(t)) = A n (M n a n ;u(t))-3inA n+1 (M n (ki, fc„_i, k n +k n+1 )(k n +k n+1 ) 
where 

a n '■— kf + . . . + k^. 

(see [8]). Thus for instance we have 

j t K 2 {l;u{t)) = A 2 (a 2 ; «(t)) - 6iA 3 (fc 2 + h; u(t)) 

= A 2 {kf + kl; u(t)) - 4iA 3 (fci +k 2 + k 3 ;u(t)) 
= 0-0, 
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demonstrating the conservation of the L 2 norm. 

Henceforth we shall omit the u(t) from the A„ notation for brevity. We also 
adopt the convenient notation that fey := fc, + kj, etc., thus for instance fci45 = 
fci + fc 4 + fc 5 . Also we write m, := m(fcj), := m(fcy), etc, and AT* for |fcj|, ATy 
for \hj\, etc. 

Let A 3> 1 be a large number to be chosen later 26 , and let m(k) be a multiplier 
which equals 1 on [—A, A], equals (|fc|/^4) s for \k\ > 2A, and is real, even, and 
smooth in between. We denote the corresponding Fourier multiplier by I: 

Iu(k) := m(k)u(k), 

thus / acts like the identity on frequencies < A and is smoothing on frequencies 
> A. We define the modified energy E 2 (t) by 

E 2 (t) := A 2 (roim 2 ), 

then one can verify that 

\Ht)\\%,<E 2 (t)<A-^\\u(t)\\%s. 
From (9.2), (9.1) and the fact that a 2 = we have 

^j. E 2(t) = -6iA 3 (miTO2 3 fc23) 

= 6iA 3 (m 2 fci) 
= A 3 (M 3 ) 

where M 3 is the 3-multiplier 

M 3 := 2i(m\ki + mjfe + m- 3 fe3). 
Now define the modified energy E 3 (t) by 

Es(t) := E 2 (t) + A s {a s ) 
where cr 3 (fei, fe 2 , fc 3 ) is the 3-multiplicr 

a 3 :— —Ms/a^. 

This multiplier may appear to be singular at first glance, but we observe that 

(9.3) a z = k\ + k\ + kl = 3fcife 2 fc 3 

and that M 3 vanishes whenever k\k 2 k^ = 0. Then by (9.2), (9.1) we have 

^E 3 {t) = A 3 (M 3 ) + A 3 (CT 3 a 3 ) - -9iA 4 (cr 3 (fci, fc 2 , fc 34 )fc 3 4) 

= A 4 (M 4 ) 
where M 4 is the 4-multiplier 

M 4 := -9«[cr 3 (fei, fc 2 , fc 34 )fe 34 ] sym - 
Now define the modified energy Ei{t) by 

E 4 (t) := E 3 (t) + A 4 M 

26 Note that the quantity A here represents what was called N in [8], a notational change 
necessary since in the present paper N represents something else. 



SYMPLECTIC NON-SQUEEZING FOR THE KDV FLOW 



43 



where o^fci, k 2 , k 3 , ki) is the 4-multiplicr 

04 := —Mi/a^ 

This multiplier may appear to be singular at first glance, but we observe that 

(9.4) a A = k\ + k\ + fcf + k\ = 3k 12 k 13 k 14 

(cf. (4.17)), and one can check that M 4 vanishes when ^12^13^14 = 0. Then as 
before we have that 

(9.5) ^ 4 (i) = A 5 (M 5 ) 
where 

M 5 := -l2i[a i (k 1 ,k 2 ,k Zl k^)k ib ] S y m . 

We could continue this procedure indefinitely, but E4 will turn out to be a suitable 
almost conserved quantity for our purposes. In [8] it was shown (by Gagliardo- 
Nircnberg type arguments) that £4 is bounded if and only if ||u||_f/= is bounded, so 
to obtain (1.3) it suffices to control E A (t). In light of (9.5) it will suffice to control 
M 5 . The key lemma here was the following: 

Lemma 9.1. [8] Let k\,k 2 ,k 3 ,ki, k$ be real numbers (not necessarily integer) such 
that £42345 = 0. Then M 5 (ki, . . . , fc 5 ) vanishes when N±, . . . ,N 5 <^ A. In all other 
cases we have the bound 

|M5(fcl ' ' ' ' ' h)l ~ [ (A + Ni)(A + N 2 )(A + 4 N 3 )(A + N 45 ) ]sym 

where 

= min(JVi, N 2 , N 3 , N 45 ,N 12 ,N 13 , N u ). 



With this bound and some multilinear Y s estimates , bounds on the growth 
of Ei(t) was obtained. In particular, if Ei(T) was small for some time T, it was 
possible to obtain the bound E^T + 5) = E^T) + 0(A~^~) for some small time 
5^1. Iterating this and using a rescaling argument one could obtain (1.3) for all 
s > —1/2 (after choosing A appropriately depending on Huoll^ and T). See [8] for 
details. 



9.2. Adapting the argument to the £>KdV flow. We now adapt the above 
argument to the flow (1.7). The main difference will be the appearance of various 
quantities of the form b(ki), b(kij), etc., however these factors will play essentially no 
role in the argument. Accordingly, we write bi for b(ki), etc. We shall assume that 
the frequency parameter N corresponding to b is much larger than the frequency 
parameter A corresponding to m. 



Strictly speaking, in order to handle large data, these estimates had to take place in the 
large period setting M./2ttXZ, as one would need to rescale large data to be small. This causes 
some unpleasant technical complications in the arguments, and in particular this is why the kj in 
the above lemma need to be real (or lie in Z*/A) rather than integer. See [8], [9] for more details. 
In this paper we will ignore the large period issue as it does not cause any essential change to the 
argument. 
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Suppose u solves (1.7). Then (9.2) now becomes 
(9.6) 

— A n (M n ;u(t)) = A n (M n a n ;u(t))-3inA n+1 (M n (k-i, . . . , fc„_i, k n +k n+1 )b(k n +k n+1 )(k n +k n+1 );u(t)). 
Again we define 

E 2 {t) := A 2 (m 1 m 2 ), 

then one can verify that 

j f E2{t) = A 3 (M 3 ) 

where M 3 is the 3-multiplier 

M 3 := 2i(/i + h + /s) 

and f(k) := m 2 (k)b{k)k. Observe that / is an odd function with f'(k) = 0(m{k)) 
and f"(k) = 0(m(k)/(A + \k\)) for all k. 

We observe the following bounds on M 3 . 
Lemma 9.2. If Ni, A 2 , A 3 <C A, then M 3 = 0. Otherwise, we have 
|M 3 | < max(m?,m|,m§)min(iVi, A 2 , A 3 ). 



Proo/. [8] When 7Vi,A 2 ,iV 3 < A then /j = k t for i = 1, 2, 3, and the claim is 
clear. Otherwise, we use symmetry to assume that N\ ~ A^ 2 > A 3 . But then the 
mean- value theorem and the above bounds on / gives 

h = -/is = -fi + 0{m\N s ), 
and the claim easily follows. □ 

Now define the modified energy E 3 (t) by 

E 3 (t) := E 2 (t) + A 3 (a 3 ) 
where <j 3 (ki 7 k 2 , k 3 ) is the 3-multiplier 

a 3 := -M 3 /a 3 . 

From Lemma 9.2, (9.3) we see that a 3 vanishes when max(A^i, N 2 , N 3 ) N, 
and we have the bounds 

max(mf,m|,ml) 
1 ' j 1 31 ~ (7V + max(7V 1 ,A 2 ,A 3 )) 2 

otherwise (note that the two largest values of Nj have to be comparable). 

By (9.2), (9.1) we have 

j f E 3 {t) = A 4 (M 4 ) 

where M4 is the 4-multiplier 

M A := -9i[a 3 (ki,k 2 ,k 3 4)b 3 4k 3 4\ svm . 
Now define the modified energy E^t) by 

E 4 (t) := E 3 {t) + A 4 {a 4 ) 
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where a 4 (ki, k 2 , k 3 , k 4 ) is the 4-multiplicr 

a 4 := — M 4 /a 4 . 

Then as before we have that 

j t E 4 (t) = A 5 (M 5 ) 

where 

M 5 := -12i[a4(A;i, /c 2 , A: 3 , ^45)645^45]^™. 

Our aim is to show that this new M 5 still verifies the bounds in Lemma 9.1; the 
rest of the arguments in [8] will then give the desired bound (6.3) (the presence of 
the B multiplier having no impact on the local well-posedness theory). 

From the definition of 0-4 and M 5 , it will suffice to prove the following M 4 bound. 
Lemma 9.3. 7/max(A r 1 , N 2 , N 3 , N 4 ) <C A then M4 vanishes. Otherwise, we have 

| M ,< M 

1 (A + N^A + N^A + N^A + Ni)' 

where := mm(Ni,N 2 ,N 3 , N 4 , N 12 , N 13 , N 14 ). 

Proof. When max(7Vi, N 2 , N 3 , N 4 ) <C A then 173(^1,^2,^34) and all of its sym- 
metrizations vanish, hence M 4 vanishes. Now we assume that max(iVi, N 2 ,N 3 , N 4 ) > 
A. By symmetry we may assume that N\ > N 2 > N 3 > N 4 , thus Ni <~ N 2 > A. 
From (9.4) we have \a 4 \ ~ Ni 3 Ni 4 N 34 . 

We divide into several cases depending on the relative sizes of N 2 ,N 3 ,N 4 . 

Case 1: N 2 ^> N 3 ^> N 4 . In this case \a 4 \ <~ N 2 N 3 , thus we reduce to showing 

m{N*) 2 



\M 4 \ 



< 



A + N 4 

But from Lemma 9.2 we have 



^2 



1 fu i u \u u 1 < mm(m a ,m b ,m cd ) 2 
\a 3 (k a ,k b ,k cd )b cd k cd \ < — — - < ro (JV.) /(A + 7V 4 ) 

A + max(iv a , N b , N cd ) 

as desired. 

Case 2: N 2 ^ N 3 ^> N 4 . In this case |a 4 | ~ TVf , thus we reduce to showing 

One then proceeds as in Case 1. 

Case 3: N 2 3> N 3 ~ A/4. In this case |a 4 | ~ N 2 N 34 , thus we reduce to showing 

m(Ag^ 3 

1 41 ~ (A + TVg) 2 

From Lemma 9.2 we have 

1*3 k 2 , k 34 )b 34 k 34 \ < m{N,) 2 N 34 /{A + max(N u N 2 , N 34 )) 2 
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which is acceptable. Similarly 

1 0-3 (fc 3 , fc 4 , k 12 )b 12 k 12 \ < m(N,) 2 N 12 /(A + max(JV 3 , N 4 , N 12 )) 2 
is acceptable since Ni 2 = N 34 . It thus suffices to show that 

m(Ag 2 Ar 34 



\<73{ki,ka, k 24 )b 24 k 24 +a 3 (ki, k 4l k 23 )b 23 k 23 +cr 3 (k 2l k 3l ki 4 )bi 4 ki 4 +cr 3 (k 2l k 4 , fci 3 )6i 3 A;i3 1 < 



{A + iV 3 ) 2 



We expand out u 3 using (9.3), and replace k\ by — £; 23 4 throughout, and reduce 
to showing 

I b 24 (f 3 + /24 - /234) &23(/4 + Hz ~ hu) | b 23 (f 2 + / 3 - / 23 ) | ^(.fe + .A - hi) , < m(jV^) 2 jV 34 
^234^3 ^234^4 ^2^3 ^2^4 ~ (A + 7V 3 ) 2 

From the mean-value theorem we have b 23 = b 2 + 0{N 3 /N 2 ) — b 2 + 0{N 3 /(A + 
N 3 )). Similarly b 24 = b 2 + 0{N 3 /(A + N 3 )). Let us consider the contribution of 
the 0{N 3 /{A + Ni)) errors. It will suffice to show that 

h + hi — /234 _^ h + fi — /24 
^234^3 k 2 ki 

and 

fi + /23 — /234 /2 + /3 — /23 
k 234 k 4 fefe 

are both O(^^p^y). By the fc 3 <-> fc 4 symmetry it suffices to estimate the former 
expression. From the mean-value theorem we have 

^\ AT AT2 I' 



k 234 k 3 (k 2 + fc 34 )(-fc 4 + fc 34 ) k 2 k 4 N 2 Nl 

■ JV34 

^AT 2 JV, 



By Lemma 9.2, the contribution of the error term 0( J? 3 *-} ) is bounded by 



which is acceptable. Thus it suffices to show that 

fa + hi - / 234 h + / 4 - / 24 = m(jV,) 2 jV 34 
fc 2 fc 4 fc 2 fc 4 l 7V 3 (A + iV 3 ) j - 

But from the mean-value theorem we have 

/(**) - /(fc 234 ) + /(fc 3 ) - /(fc 3 - k 34 ) = 0(m(N*) 2 N 34 ), 
and the claim follows by dividing by k 2 k 4 . 

Case 4: N 2 ^ N 3 ^ N 4 . Observe this case is essentially symmetric in the 
indices 1,2,3,4. By definition of M 4 , <j 3 , a 3 we have 

r (/l + /2 + /34)b 3 4 1 , 



Ma 



hk 2 lSyml 



V2 

^Vf 4 |[(/i + h + hi)b 3i k 3 k 4 \ 

6 



J sym | • 

Our task is thus to show that 

[(A + / 2 + fs4)bs4ksk 4 } sym = 0(m(N*) 2 N 12 N 23 N 13 ). 
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Since 634 = 612, it will suffice by symmetry to show that 

(/1 + h + hi)k 3 k± + (/ 3 + h + Aajfeife = 0(m(iV.) 2 iVi2JV23JVi3). 
Observe the identity 

k 3 k 4 - kik 2 = k 3 k± + k 23i k 2 = ^23^24 

hence we can write the left-hand side as 

(/1 + h + h + h)kik 2 + (/1 + h + hi)k2sk 2i 

(since fa = — fi 2 ). By Lemma 9.2, the second term is 0(m(N*) 2 N 3 4N 23 N 2 4) 
which is acceptable. Thus it will suffice to show that 

fi + h + h + fi = 0(m(N*) 2 N 12 N 23 N 13 /N 2 ). 

Since k\ 2 + fc 13 + k 23 — — 2fc 4 , we see that at least one of N 12 , iV 13 , N 23 is comparable 
to Ni. Without loss of generality we may take N 23 ~ N\. We now write the left- 
hand side as 

f(h) - f(h - k 12 ) - f(h - k 13 ) + f(h - k 12 - k 13 ) 

and using the double mean- value theorem 28 (since /" = 0(N^ 1 ) here), to conclude 
the argument. □ 

References 

[1] J. Bourgain, Fourier restriction phenomena for certain lattice subsets and applications to 

nonlinear evolution equations, Part II, Geometric and Funct. Anal. 3 (1993), 209-262. 
[2] J. Bourgain, Aspects of longtime behaviour of solutions of nonlinear Hamiltonian evolution 

equations, GAFA 5 (1995), 105-140. 
[3] J. Bourgain, Approximation of solutions of the cubic nonlinear Schrddinger equations by 

finite- dimensional equations and nonsqueezing properties, Int. Math. Res. Notices, 1994, no. 

2, (1994), 79-90. 

[4] J. Bourgain, Periodic Korteweg de Vries equation with measures as initial data, Selecta Math. 

(N.S.) 3 (1997), 115-159. 
[5] J. Bourgain, Global solutions of nonlinear Schrddinger equations, AMS Publications, 1999. 
[6] M. Christ, J. Colliander, T. Tao, Illposedness for canonical defocussing equations below the 

endpoint regularity, to appear. 
[7] J. Colliander, M. Keel, G. Staffilani, H. Takaoka, T. Tao, Global well-posedness result for 

KdV in Sobolev spaces of negative index, Elcc. J. Diff. Eq. 2001 (2001) No 26, 1-7. 
[8] J. Colliander, M. Keel, G. Staffilani, H. Takaoka, T. Tao, Sharp global well-posedness for 

periodic and non-periodic KdV and mKdV on R and T, J. Amer. Math. Soc. 16 (2003), 

705-749. 

[9] J. Colliander, M. Keel, G. Staffilani, H. Takaoka, T. Tao, Multilinear estimates for periodic 
KdV equations, and applications, Journ. Funct. Analy. 211 (2004), no. 1, 173—218. 

[10] J. Colliander, G. Staffilani, H. Takaoka, Global well-posedness of the KdV equation below L 2 , 
Math Res. Letters 6 (1999), 755-778. 

[11] L. Dickey, Soliton equations and Hamiltonian systems, World Scientific, 1991. 

[12] C.S. Gardner, Korteweg-de Vries equation and generalizations IV, J. Math. Phys. 12 (1971), 
no. 8, 1548-1551. 

[13] M. Gromov, Pseudo-holomorphic curves in symplectic manifolds, Invent, math., 82 (1985), 
307-347. 

[14] H. Hofer, E. Zehnder, A new capacity for symplectic manifolds., in Analysis et cetera, Aca- 
demic Press (1990), 405-428. Edited by P. Rabinowitz and E. Zehnder. 

28 One could object that /" is much larger than N^ 1 near the origin. However, since we are 
only evaluating / at points in the annulus {k : \k\ ~ N\}, we can smooth out / inside this annulus 
so that /" = 0(N^ 1 ) throughout the interval {k : \k\ < Ni} without affecting the left-hand side. 



48 



J. COLLIANDER, M. KEEL, G. STAFFILANI, H. TAKAOKA, AND T. TAO 



[15] H. Hofcr, E. Zchnder, Symplectic Invariants and Hamiltonian Dynamics, Birkhauser Verlag, 
1994. 

[16] T. Kappeler, P. Topalov, Global well-posedness of KdV in ff _1 (T, R), preprint 2003. 
[17] T. Kappeler, P. Topalov, Global fold structure of the Miura map on L 2 , IMRN 2004:39 (2004), 
2039-2068. 

[18] C. Kcnig, G. Ponce, L. Vega, A bilinear estimate with applications to the KdV equation, J. 
Amer. Math. Soc. 9 (1996), 573-603. 

[19] C. Kcnig, G. Ponce, L. Vega, On the ill-posedness of some canonical dispersive equations, 
Duke Math. J. 106 (2001), no 3, 617-633.. 

[20] C. Kenig, G. Ponce, L.Vega, The Cauchy problem for the Korteweg-de Vries equation in 
Sobolev spaces of negative indices, Duke Math. J., 71 (1993), 1—21. 

[21] S. Kuksin, Infinite- dimensional symplectic capacities and a squeezing theorem for Hamilton- 
ian PDE's, CMP 167 (1995), 521-552. 

[22] S. Kuksin, Analysis of Hamiltonian PDEs, Oxford Lecture Series in Mathematics and Its 
Applications, 19, Oxford Univ. Press, 2000. 

[23] F. Magri, A simple model of the integrable Hamiltonian equation, J. Math. Phys. 19 (1978), 
no. 5, 1156-1162. 

[24] R. Miura, Korteweg-de Vries equation and generalizations. I. A remarkable explicit nonlinear 

transformation, J. Mathematical. Phys. 9 (1968), 1202-1204. 
[25] P. Olvcr, Applications of Lie groups to differential equations, Springer, 1997. 
[26] A. Sjoberg, On the Korteweg-de Vries equation: existence and uniqueness, J. Math. Anal. 

Appl. 29 (1970), 569-579. 
[27] H. Takaoka, Y. Tsutsumi, Well-posedness of the Cauchy Problem for the modified KdV 

equation with periodic boundary condition, Internat. Math. Res. Notices 2004, no. 56, 3009- 

3040. 

[28] T. Tao, Multilinear weighted convolution of I? functions, and applications to non-linear 

dispersive equations, Amer. J. Math. 123 (2001), 839-908. 
[29] T. Tao, Global regularity of wave maps I. Small critical Sobolev norm in high dimension, 

IMRN 7 (2001), 299-328. 
[30] T. Tao, Global regularity of wave maps II. Small energy in two dimensions, Comm. Math. 

Phys. 224 (2001), 443-544. 
[31] V.E. Zakharov, L.D. Faddcev, The Korteweg-de Vries equation is a completely integrable 

Hamiltonian System, Funkz. Anal. Priloz. 5 (1971), no. 4, 18-27. 



University of Toronto 



University of Minnesota 



M.I.T. 



Kobe University and the University of Chicago 



University of California, Los Angeles 



